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Preface 


This book is intended as a text for a first-year physical-chemistry or chemical- 
physics graduate course in quantum mechanics. Emphasis is placed on a 
rigorous mathematical presentation of the principles of quantum mechanics 
with applications serving as illustrations of the basic theory. The material is 
normally covered in the first semester of a two-term sequence and is based on 
the graduate course that I have taught from time to time at the University of 
Pennsylvania. The book may also be used for independent study and as a 
reference throughout and beyond the student’s academic program. 

The first two chapters serve as an introduction to quantum theory. It is 
assumed that the student has already been exposed to elementary quantum 
mechanics and to the historical events that led to its development in an 
undergraduate physical chemistry course or in a course on atomic physics. 
Accordingly, the historical development of quantum theory is not covered. To 
serve as a rationale for the postulates of quantum theory, Chapter 1 discusses 
wave motion and wave packets and then relates particle motion to wave motion. 
In Chapter 2 the time-dependent and time-independent Schrodinger equations 
are introduced along with a discussion of wave functions for particles in a 
potential field. Some instructors may wish to omit the first or both of these 
chapters or to present abbreviated versions. 

Chapter 3 is the heart of the book. It presents the postulates of quantum 
mechanics and the mathematics required for understanding and applying the 
postulates. This chapter stands on its own and does not require the student to 
have read Chapters 1 and 2, although some previous knowledge of quantum 
mechanics from an undergraduate course is highly desirable. 

Chapters 4, 5, and 6 discuss basic applications of importance to chemists. In 
all cases the eigenfunctions and eigenvalues are obtained by means of raising 
and lowering operators. There are several advantages to using this ladder 
operator technique over the older procedure of solving a second-order differ- 
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IX 


ential equation by the series solution method. Ladder operators provide practice 
for the student in operations that are used in more advanced quantum theory 
and in advanced statistical mechanics. Moreover, they yield the eigenvalues 
and eigenfunctions more simply and more directly without the need to 
introduce generating functions and recursion relations and to consider asymp¬ 
totic behavior and convergence. Although there is no need to invoke Hermite, 
Legendre, and Laguerre polynomials when using ladder operators, these func¬ 
tions are nevertheless introduced in the body of the chapters and their proper¬ 
ties are discussed in the appendices. For traditionalists, the series-solution 
method is presented in an appendix. 

Chapters 7 and 8 discuss spin and identical particles, respectively, and each 
chapter introduces an additional postulate. The treatment in Chapter 7 is 
limited to spin one-half particles, since these are the particles of interest to 
chemists. Chapter 8 provides the link between quantum mechanics and 
statistical mechanics. To emphasize that link, the free-electron gas and Bose- 
Einstein condensation are discussed. Chapter 9 presents two approximation 
procedures, the variation method and perturbation theory, while Chapter 10 
treats molecular structure and nuclear motion. 

The first-year graduate course in quantum mechanics is used in many 
chemistry graduate programs as a vehicle for teaching mathematical analysis. 
For this reason, this book treats mathematical topics in considerable detail, 
both in the main text and especially in the appendices. The appendices on 
Fourier series and the Fourier integral, the Dirac delta function, and matrices 
discuss these topics independently of their application to quantum mechanics. 
Moreover, the discussions of Hermite, Legendre, associated Legendre, La¬ 
guerre, and associated Laguerre polynomials in Appendices D, E, and F are 
more comprehensive than the minimum needed for understanding the main 
text. The intent is to make the book useful as a reference as well as a text. 

I should like to thank Corpus Christi College, Cambridge for a Visiting 
Fellowship, during which part of this book was written. I also thank Simon 
Capelin, Jo Clegg, Miranda Fyfe, and Peter Waterhouse of the Cambridge 
University Press for their efforts in producing this book. 


Donald D. Fitts 
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The wave function 


Quantum mechanics is a theory to explain and predict the behavior of particles 
such as electrons, protons, neutrons, atomic nuclei, atoms, and molecules, as 
well as the photon-the particle associated with electromagnetic radiation or 
light. From quantum theory we obtain the laws of chemistry as well as 
explanations for the properties of materials, such as crystals, semiconductors, 
superconductors, and superfluids. Applications of quantum behavior give us 
transistors, computer chips, lasers, and masers. The relatively new field of 
molecular biology, which leads to our better understanding of biological 
structures and life processes, derives from quantum considerations. Thus, 
quantum behavior encompasses a large fraction of modern science and tech¬ 
nology. 

Quantum theory was developed during the first half of the twentieth century 
through the efforts of many scientists. In 1926, E. Schrodinger interjected wave 
mechanics into the array of ideas, equations, explanations, and theories that 
were prevalent at the time to explain the growing accumulation of observations 
of quantum phenomena. His theory introduced the wave function and the 
differential wave equation that it obeys. Schrodinger’s wave mechanics is now 
the backbone of our current conceptional understanding and our mathematical 
procedures for the study of quantum phenomena. 

Our presentation of the basic principles of quantum mechanics is contained 
in the first three chapters. Chapter 1 begins with a treatment of plane waves 
and wave packets, which serves as background material for the subsequent 
discussion of the wave function for a free particle. Several experiments, which 
lead to a physical interpretation of the wave function, are also described. In 
Chapter 2, the Schrodinger differential wave equation is introduced and the 
wave function concept is extended to include particles in an external potential 
field. The formal mathematical postulates of quantum theory are presented in 
Chapter 3. 


1 
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The wave function 


1.1 Wave motion 


Plane wave 

A simple stationary harmonic wave can be represented by the equation 


ip(x) = cos 


2jtx 

~T~ 


and is illustrated by the solid curve in Figure 1.1. The distance X between peaks 
(or between troughs) is called the wavelength of the harmonic wave. The value 
of ip(x) for any given value of x is called the amplitude of the wave at that 
point. In this case the amplitude ranges from +1 to — 1. If the harmonic wave is 
A cos{2jtx/X), where A is a constant, then the amplitude ranges from +A to 
—A. The values of x where the wave crosses the x-axis, i.e., where >p(x) equals 
zero, are the nodes of rp(x). 

If the wave moves without distortion in the positive x-direction by an amount 
xo, it becomes the dashed curve in Figure 1.1. Since the value of ip(x) at any 
point x on the new (dashed) curve corresponds to the value of ip(x) at point 
x — x 0 on the original (solid) curve, the equation for the new curve is 

2jt 

ip{x) = cos — (x - Xo) 

A 


If the harmonic wave moves in time at a constant velocity v, then we have the 
relation x 0 = vt, where t is the elapsed time (in seconds), and tp(x) becomes 

2 71 

ip(x, t) = cos — (x — vt) 

A 

Suppose that in one second, v cycles of the harmonic wave pass a fixed point 
on the x-axis. The quantity v is called the frequency of the wave. The velocity 



Figure 1.1 A stationary harmonic wave. The dashed curve shows the displacement of 
the harmonic wave by x 0 . 
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v of the wave is then the product of v cycles per second and X, the length of 
each cycle 

v = vX 


and ip(x, t) may be written as 

ip(x, t) 


cos2?r 



It is convenient to introduce the wave number k, defined as 



and the angular frequency cu, defined as 

co = 2jiv ( 1 . 2 ) 

Thus, the velocity v becomes u = cu/k and the wave >p(x, t) takes the form 

ip{x, t ) = cos(Ax — cut) 

The harmonic wave may also be described by the sine function 

>p(x, t) = sin(Ax — cut) 

The representation of rp(x, t) by the sine function is completely equivalent to 
the cosine-function representation; the only difference is a shift by 2/4 in the 
value of x when t = 0. Moreover, any linear combination of sine and cosine 
representations is also an equivalent description of the simple harmonic wave. 
The most general representation of the harmonic wave is the complex function 

rp(x, t) = cos (kx — cut) + i sin(Ax — cut) = (1.3) 

where i equals \/—I and equation (A.31) from Appendix A has been intro¬ 
duced. The real part, cos {kx — cut), and the imaginary part, sin(Ax — cut), of the 
complex wave, (1.3), may be readily obtained by the relations 

Re [e l(far-c °d] = cos {kx — cut) = ^ [f(x, t) + xp*{x, /)] 

Im = sin(Ax — cut) = ^ [ ip(x, t) — ip*(x, ?)] 

where >p*(x, t) is the complex conjugate of ip(x, t) 

ip*(x, t) = cos(A-x — cut) — i sin(Ax — cut) = e _l(foc_ ®d 

The function ip*(x, t) also represents a harmonic wave moving in the positive 
x-direction. 

The functions exp[i(Ax + cut)) and exp[—i {kx + cut)\ represent harmonic 
waves moving in the negative x-direction. The quantity (kx + out) is equal to 
k(x + vt) or k(x + x 0 ). After an elapsed time t, the value of the shifted 

harmonic wave at any point x corresponds to the value at the point x + xo at 

time t = 0. Thus, the harmonic wave has moved in the negative x-direction. 
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The wave function 


The moving harmonic wave ip(x, t) in equation (1.3) is also known as a 
plane wave. The quantity (kx — cot) is called the phase. The velocity a>/k is 
known as the phase velocity and henceforth is designated by v p u, so that 

to 

V P U = k (1.4) 


Composite wave 

A composite wave is obtained by the addition or superposition of any number 
of plane waves 

n 

>F(x, t) = J2 .4/e iU/V (1.5) 

7=1 

where Aj are constants. Equation (1.5) is a Fourier series representation of 
W(x, t ). Fourier series are discussed in Appendix B. The composite wave 
^P(x, t ) is not a moving harmonic wave, but rather a superposition of n plane 
waves with different wavelengths and frequencies and with different ampli¬ 
tudes Aj. Each plane wave travels with its own phase velocity t> p h j, such that 

O); 

V * J = T 

As a consequence, the profile of this composite wave changes with time. The 
wave numbers kj may be positive or negative, but we will restrict the angular 
frequencies 0 )j to positive values. A plane wave with a negative value of k has 
a negative value for its phase velocity and corresponds to a harmonic wave 
moving in the negative x-direction. In general, the angular frequency a> 
depends on the wave number k. The dependence of o>( k) is known as the law 
of dispersion for the composite wave. 

In the special case where the ratio a>(k)/k is the same for each of the 
component plane waves, so that 

(0 1 <D 2 CD n 

k\ k 2 k n 

then each plane wave moves with the same velocity. Thus, the profile of the 
composite wave does not change with time even though the angular frequencies 
and the wave numbers differ. For this undispersed wave motion, the angular 
frequency oj(k) is proportional to | k\ 

a>(k) = c\k\ (1.6) 

where c is a constant and, according to equation (1.4), is the phase velocity of 
each plane wave in the composite wave. Examples of undispersed wave motion 
are a beam of light of mixed frequencies traveling in a vacuum and the 
undamped vibrations of a stretched string. 
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For dispersive wave motion, the angular frequency co(k ) is not proportional 
to \k\, so that the phase velocity u p h varies from one component plane wave to 
another. Since the phase velocity in this situation depends on k, the shape of 
the composite wave changes with time. An example of dispersive wave motion 
is a beam of light of mixed frequencies traveling in a dense medium such as 
glass. Because the phase velocity of each monochromatic plane wave depends 
on its wavelength, the beam of light is dispersed, or separated onto its 
component waves, when passed through a glass prism. The wave on the surface 
of water caused by dropping a stone into the water is another example of 
dispersive wave motion. 


Addition of two plane waves 

As a specific and yet simple example of composite-wave construction and 
behavior, we now consider in detail the properties of the composite wave 
x ¥(x, t) obtained by the addition or superposition of the two plane waves 
exp[i(£ix — at 1 0] and exp[i(A: 2 X — o> 2 /)] 

W(x, t) = e i( * lJC_<Ul t] + e' (k2X ~ W2t) (1.7) 

We define the average values k and (id and the differences A k and Ao> for the 
two plane waves in equation (1.7) by the relations 


j _ k\ + 


a>i + a>2 

co = -- 


A k = k\ — k 2 A co = co\ — (02 


so that 


k i — k -\- 


A k 

~ 2 ’ 

_ A co 

0)1 = CO+-, 


kj = k — 


A k 
A co 


(O 2 = co — 


Using equation (A.32) from Appendix A, we may now write equation (1.7) in 
the form 


W(x, t) = e 1 


_ p i(&x— o)t)^i(Akx—Acot)/2 _j_ ^—[(Akx—Acot)/! 


+ e~ 


-] 


= 2 cos 


(Akx-Acot\ K j x _ m) 


V 


7 


( 1 . 8 ) 


Equation (1.8) represents a plane wave exp[i(/cx — Tot)] with wave number k, 
angular frequency To, and phase velocity To/k, but with its amplitude modulated 
by the function 2 cos[(A£x — Acot)/2\. The real part of the wave (1.8) at some 
fixed time t 0 is shown in Figure 1 .2(a). The solid curve is the plane wave with 
wavelength X = 2n/k and the dashed curve shows the profile of the amplitude 
of the plane wave. The profile is also a harmonic wave with wavelength 
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2jz 




Figure 1.2 (a) The real part of the superposition of two plane waves is shown by the 
solid curve. The profile of the amplitude is shown by the dashed curve. ( b ) The 
positions of the curves in Figure 1.2(a) after a short time interval. 


4ji/Ak. At the points of maximum amplitude, the two original plane waves 
interfere constructively. At the nodes in Figure 1.2(a), the two original plane 
waves interfere destructively and cancel each other out. 

As time increases, the plane wave cxp[i(Ax — cot)] moves with velocity Ft/ k. 
If we consider a fixed point x\ and watch the plane wave as it passes that point, 
we observe not only the periodic rise and fall of the amplitude of the 
unmodified plane wave exp[i(A:x — Ft/)], but also the overlapping rise and fall 
of the amplitude due to the modulating function 2 cos [(A Ax — Am/)/2]. With¬ 
out the modulating function, the plane wave would reach the same maximum 
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and the same minimum amplitude with the passage of each cycle. The 
modulating function causes the maximum (or minimum) amplitude for each 
cycle of the plane wave to oscillate with frequency Ato/2. 

The pattern in Figure 1.2(a) propagates along the x-axis as time progresses. 
After a short period of time At, the wave (1.8) moves to a position shown in 
Figure 1.2(6). Thus, the position of maximum amplitude has moved in the 
positive x-direction by an amount v g A t, where v g is the group velocity of the 
composite wave, and is given by 




A to 
A k 


(1.9) 


The expression (1.9) for the group velocity of a composite of two plane waves 
is exact. 

In the special case when k 2 equals — k\ and a> 2 equals ao\ in equation (1.7), 
the superposition of the two plane waves becomes 

W(x, t) = e (kx - mt) + e~ i(kx+0Jt) ( 1 . 10 ) 


where 


k = k\ = — k 2 
a> = a>i = co 2 


The two component plane waves in equation (1.10) travel with equal phase 
velocities co/k, but in opposite directions. Using equations (A.31) and (A.32), 
we can express equation (1.10) in the form 


x V(x, t) 


(e ikx + e~ lla )e 


—iAx\_—i cot 


2 cos Axe lwt 


= 2 cos Ax (cos co t — i sin co t) 


We see that for this special case the composite wave is the product of two 
functions: one only of the distance x and the other only of the time t. The 
composite wave fifix, t) vanishes whenever cos Ax is zero, i.e., when Ax = ti/2, 
2>ji/2, 5jt./2, ..., regardless of the value of t. Therefore, the nodes of W(x, t) 
are independent of time. However, the amplitude or profile of the composite 
wave changes with time. The real part of l I ; (x, t) is shown in Figure 1.3. The 
solid curve represents the wave when cos cot is a maximum, the dotted curve 
when cos cot is a minimum, and the dashed curve when cos cot has an 
intermediate value. Thus, the wave does not travel, but pulsates, increasing and 
decreasing in amplitude with frequency co. The imaginary part of 'tfix, t) 
behaves in the same way. A composite wave with this behavior is known as a 
standing wave. 
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Re V(x:, t) 


x 


Figure 1.3 A standing harmonic wave at various times. 



1.2 Wave packet 

We now consider the formation of a composite wave as the superposition of a 
continuous spectrum of plane waves with wave numbers k confined to a narrow 
band of values. Such a composite wave v F(x, t ) is known as a wave packet and 
may be expressed as 

1 f°° 

'I'(x, t) = —= A(k)e {kx - (Ut) dk (1.11) 

V^J-oo 

The weighting factor A(k ) for each plane wave of wave number k is negligible 
except when k lies within a small interval A k. For mathematical convenience 
we have included a factor (2n) 1/2 on the right-hand side of equation (1.11). 
This factor merely changes the value of A ( k ) and has no other effect. 

We note that the wave packet Tfix, /) is the inverse Fourier transform of 
A(k). The mathematical development and properties of Fourier transforms are 
presented in Appendix B. Equation (1.11) has the form of equation (B. 19). 
According to equation (B.20), the Fourier transform A(k) is related to l F(x, /) 
by 

1 pOO 

A(k) = —= W(x, t)Q-' (kx -° Jt) dx (1.12) 

v — ex) 

It is because of the Fourier relationships between ^(x, t) and A(k) that the 
factor (2is included in equation (1.11). Although the time t appears in 
the integral on the right-hand side of (1.12), the function A(k) does not depend 
on t; the time dependence of ’F(x, l) cancels the factor e u "‘. We consider below 
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two specific examples for the functional form of A(k). However, in order to 
evaluate the integral over k in equation (1.11), we also need to know the 
dependence of the angular frequency o> on the wave number k. 

In general, the angular frequency «>( k) is a function of k, so that the angular 
frequencies in the composite wave W(x, t), as well as the wave numbers, vary 
from one plane wave to another. If a>(k) is a slowly varying function of k and 
the values of k are confined to a small range A k, then o>(k) may be expanded 
in a Taylor series in k about some point ko within the interval Ak 

a)(k) = a>o + — *o) + 2 ~~ k°) 2 + ■ • • (1-13) 

where &> 0 is the value of o>{k) at /c 0 and the derivatives are also evaluated at k (} . 
We may neglect the quadratic and higher-order terms in the Taylor expansion 
(1.13) because the interval A k and, consequently, k — k 0 are small. Substitu¬ 
tion of equation (1.13 ) into the phase for each plane wave in (1.11) then gives 

kx — ti)t ~ (k — ko + ko)x — a>ot — ( — ) (k — ko)t 

VdA'/o 


= koX — (Do t + 


x — 



(k - ko) 


so that equation (1.11) becomes 

W(x, t ) = B(x, t)e' (koX - Wo 0 (1.14) 

where 


B(x, t) 



A(k)Q l fi-( dco / d A) o d(A-i°) ^ 


(1.15) 


Thus, the wave packet W(x, /) represents a plane wave of wave number k 0 and 
angular frequency co o with its amplitude modulated by the factor B(x, t ). This 
modulating function B(x, t) depends on x and t through the relationship 
[x — {&a) / &k)ot]. This situation is analogous to the case of two plane waves as 
expressed in equations (1.7) and (1.8). The modulating function B(x, t) moves 
in the positive x-direction with group velocity u g given by 


v s = 



(1.16) 


In contrast to the group velocity for the two-wave case, as expressed in 
equation (1.9), the group velocity in (1.16) for the wave packet is not uniquely 
defined. The point k 0 is chosen arbitrarily and, therefore, the value at k 0 of the 
derivative dco/dk varies according to that choice. However, the range of k is 
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narrow and a>(k) changes slowly with k, so that the variation in v g is small. 
Combining equations (1.15) and (1.16), we have 

1 pOG 

B(x, t ) = -= A(k)e lix -^ ,)(k - ko) d k (1.17) 

v «. —oo 

Since the function A(k) is the Fourier transform of x V(x, t), the two functions 
obey Parseval’s theorem as given by equation (B.28) in Appendix B 

*oo poo poo 

\W(x,t)\ 2 dx= \B{x, t)| 2 dx = \A{k)\ 2 &k (1.18) 

J—oo J—OO J— oo 

Gaussian wave number distribution 

In order to obtain a specific mathematical expression for the wave packet, we 
need to select some form for the function A(k). In our first example we choose 
A(k) to be the gaussian function 

A(k) = _^ e - ( *- Ao)2/2a2 (1.19) 

\j2na 

This function A(k) is a maximum at wave number k 0 , which is also the average 
value for k for this distribution of wave numbers. Substitution of equation 
(1.19) into (1.17) gives 

|rP(x, 01 = B(x, 0 = -^ e - a2<x - v Af/2 (1.20) 

s/2n 

where equation (A. 8) has been used. The resulting modulating factor B(x, t ) is 
also a gaussian function-following the general result that the Fourier transform 
of a gaussian function is itself gaussian. We have also noted in equation (1.20) 
that B(x, t) is always positive and is therefore equal to the absolute value 
|Tfix, 01 of the wave packet. The functions A(k) and |’F(x, 0| are shown in 
Figure 1.4. 



Figure 1.4 (a) A gaussian wave number distribution. ( b ) The modulating function 
corresponding to the wave number distribution in Figure 1.4(a). 
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Figure 1.5 shows the real part of the plane wave exp[i(/c 0 x — o>o0] with its 
amplitude modulated by B(x, t ) of equation (1.20). The plane wave moves in 
the positive x-direction with phase velocity v v ^ equal to o> {) / k {] . The maximum 
amplitude occurs at x = v g t and propagates in the positive x-direction with 
group velocity v g equal to (dm/d k) 0 . 

The value of the function A(k) falls from its maximum value of (\/2jra) _1 at 
A'o to 1/e of its maximum value when \k — A'o| equals \fla. Most of the area 
under the curve (actually 84.3%) comes from the range 

— Via < (k — kf)< Via 

Thus, the distance Via may be regarded as a measure of the width of the 
distribution A(k) and is called the half width. The half width may be defined 
using 1/2 or some other fraction instead of 1/e. The reason for using 1/e is 
that the value of k at that point is easily obtained without consulting a table of 
numerical values. These various possible definitions give different numerical 
values for the half width, but all these values are of the same order of 
magnitude. Since the value of |W(x, /)| falls from its maximum value of 
(2jr) —1//2 to 1/e of that value when |x — v g t\ equals V^/a , the distance V2/a 
may be considered the half width of the wave packet. 

When the parameter a is small, the maximum of the function A(k) is high 
and the function drops off in value rapidly on each side of A' 0 , giving a small 
value for the half width. The half width of the wave packet, however, is large 
because it is proportional to 1/a. On the other hand, when the parameter a is 
large, the maximum of A(k) is low and the function drops off slowly, giving a 
large half width. In this case, the half width of the wave packet becomes small. 

If we regard the uncertainty A A; in the value of k as the half width of the 
distribution A(k) and the uncertainty Ax in the position of the wave packet as 
its half width, then the product of these two uncertainties is 

Ax A A' = 2 



Figure 1.5 The real part of a wave packet for a gaussian wave number distribution. 
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Thus, the product of these two uncertainties Ax and A A' is a constant of order 
unity, independent of the parameter a. 


Square pulse wave number distribution 

As a second example, we choose A(k) to have a constant value 
between k\ and k 2 and to vanish elsewhere, so that 
A(k) =1, k\ *£ k k 2 
= 0, k < k\, k > k 2 


of unity for k 


( 1 . 21 ) 


as illustrated in Figure 1.6(a). With this choice for A(k), the modulating 
function B(x, t ) in equation (1.17) becomes 
1 f* 2 

B(x, t) = —= Q kx-v & t)(k-k») dk 

V^rJ/c, 


_j- e i(x-y g O(* 2 -*o) _ gk x ~ u g /)(i'i — ^o) j 

x/2jri(x — v g t) 


___|-gi(x— v g t)Ak/2 _ i(x— v g t)Ak/2-^ 

v / 2jri(x — v g t) 


2 sin[(x — v g t)Ak/2] 

Jt X — Vnt 


( 1 . 22 ) 


where k 0 is chosen to be (k[ + k 2 )/2, Ak is defined as (k 2 — k\), and equation 
(A.33) has been used. The function B(x, t) is shown in Figure 1.6(b). 

The real part of the wave packet ^(x, t) obtained from combining equations 
(1.14) and (1.22) is shown in Figure 1.7. The amplitude of the plane wave 
exp[i(A:ox — o»o0] is modulated by the function B(x, t) of equation (1.22), 
which has a maximum when (x — v g t) equals zero, i.e., when x = v g t. The 
nodes of B(x, t) nearest to the maximum occur when (x — v g t)Ak/2 equals 
±ji, i.e., when x is ±(2rr/A/c) from the point of maximum amplitude. If we 
consider the half width of the wave packet between these two nodes as a 
measure of the uncertainty Ax in the location of the wave packet and the width 
( k 2 — k\) of the square pulse A(k) as a measure of the uncertainty Ak in the 
value of k, then the product of these two uncertainties is 

Ax A A' = 2jt. 


Uncertainty relation 

We have shown in the two examples above that the uncertainty Ax in the 
position of a wave packet is inversely related to the uncertainty Ak in the wave 
numbers of the constituent plane waves. This relationship is generally valid and 
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Figure 1.6 (a) A square pulse wave number distribution, (b) The modulating function 
corresponding to the wave number distribution in Figure 1.6(a). 



Figure 1.7 The real part of a wave packet for a square pulse wave number distribution. 
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is a property of Fourier transforms. In order to localize a wave packet so that 
the uncertainty Ax is very small, it is necessary to employ a broad spectrum of 
plane waves in equations (1.11) or (1.17). The function A(k) must have a wide 
distribution of wave numbers, giving a large uncertainty A A:. If the distribution 
A(k) is very narrow, so that the uncertainty A k is small, then the wave packet 
becomes broad and the uncertainty Ax is large. 

Thus, for all wave packets the product of the two uncertainties has a lower 
bound of order unity 

AxAk 5* 1 (1.23) 


The lower bound applies when the narrowest possible range A A: of values for k 
is used in the construction of the wave packet, so that the quadratic and higher- 
order terms in equation (1.13) can be neglected. If a broader range of k is 
allowed, then the product AxAk can be made arbitrarily large, making the 
right-hand side of equation (1.23) a lower bound. The actual value of the lower 
bound depends on how the uncertainties are defined. Equation (1.23) is known 
as the uncertainty relation. 

A similar uncertainty relation applies to the variables t and a). To show this 
relation, we write the wave packet (1.11) in the form of equation (B.21) 

1 f°° 

W(x,t) = -= G((o)e 1(kx - at) dm (1.24) 

V «. — OO 


where the weighting factor G(o>) has the form of equation (B.22) 


G(oj) 



W(x, t)&-' ikx - wt) d t 


In the evaluation of the integral in equation (1.24), the wave number k is 
regarded as a function of the angular frequency to, so that in place of (1.13) we 
have 


k(o)) — ko + J (oj — a) o) + • • • 

If we neglect the quadratic and higher-order terms in this expansion, then 
equation (1.24) becomes 

W(x, t) = C(x, t)e' (koX -° )nt) 

where 


1 f°° 

C(x, t ) = -= ^Q-iU-Ak/dcoh^co-coo) d(jJ 
V^J-cc 

As before, the wave packet is a plane wave of wave number k 0 and angular 
frequency co 0 with its amplitude modulated by a factor that moves in the 
positive x-direction with group velocity v g , given by equation (1.16). Following 
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the previous analysis, if we select a specific form for the modulating function 
G{(d) such as a gaussian or a square pulse distribution, we can show that the 
product of the uncertainty At in the time variable and the uncertainty A to in 
the angular frequency of the wave packet has a lower bound of order unity, i.e. 

AtAco 3= 1 (1.25) 

This uncertainty relation is also a property of Fourier transforms and is valid 
for all wave packets. 


1.3 Dispersion of a wave packet 


In this section we investigate the change in contour of a wave packet as it 
propagates with time. 

The general expression for a wave packet x ¥(x, t ) is given by equation 
(1.11). The weighting factor A(k) in (1.11) is the inverse Fourier transform of 
x ¥(x, t) and is given by (1.12). Since the function A(k) is independent of time, 
we may set t equal to any arbitrary value in the integral on the right-hand side 
of equation (1.12). If we let t equal zero in (1.12), then that equation becomes 


A{k) 


1 

a/2tt . 


0)e“ i ^ 

— OO 


d£ 


(1.26) 


where we have also replaced the dummy variable of integration by g. Substitu¬ 
tion of equation (1.26) into (1.11) yields 


^(x, t) 


1 

2xc. 


x v(Z, oy [k{x -®- w, Uk&$ 


—OO 


(1.27) 


Since the limits of integration do not depend on the variables £ and k, the order 
of integration over these variables may be interchanged. 

Equation (1.27) relates the wave packet x ¥(x, /) at time t to the wave packet 
x ¥(x, 0) at time / = 0. However, the angular frequency o>(k) is dependent on k 
and the functional form must be known before we can evaluate the integral 
over k. 

If <o{k ) is proportional to \ k\ as expressed in equation (1.6), then (1.27) gives 


fifix, /) 


1 

2n 


*P(£, o y^-ct-k) 


—OO 


dkd$ 


The integral over k may be expressed in terms of the Dirac delta function 
through equation (C.6) in Appendix C, so that we have 
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xp(x, t ) = tP(£, 0)c)(x -ct-%) d§ = W(x - ct, 0) 

J — 00 

Thus, the wave packet W(x, t ) has the same value at point x and time t that it 
had at point x — ct at time t = 0. The wave packet has traveled with velocity c 
without a change in its contour, i.e., it has traveled without dispersion. Since 
the phase velocity t> p h is given by o>o / A 0 = c and the group velocity v g is given 
by (dcy/d£)o = c, the two velocities are the same for an undispersed wave 
packet. 

We next consider the more general situation where the angular frequency 
oj(k) is not proportional to \k\, but is instead expanded in the Taylor series 
(1.13) about (k — k 0 ). Now, however, we retain the quadratic term, but still 
neglect the terms higher than quadratic, so that 

ti)(k) ~ to o + Vg{k — A'o) + y(k — A:o) 2 

where equation (1.16) has been substituted for the first-order derivative and y 
is an abbreviation for the second-order derivative 

7 2\<Sk 2 Jo 

The phase in equation (1.27) then becomes 

Ar(x — £) — cot = {k — kf)(x — §) + £o(x — §) — (Dot 

— Vg t(k — ko) — yt(k — A'o) 2 

= kox — (Dot — Aro§ + (x — Vgt — f)(k — kf) — yt(k — ko) 2 

so that the wave packet (1.27) takes the form 

0 ° 

e l(koX-0) 0 t) 1 * 1 * 

Wy(x, t) = -—- W(g, 0 ) e -iko^(x-v g t-m-ko)-iYt(k-ko) dk 

—oo 

The subscript y has been included in the notation v lf,(x, /) in order to 
distinguish that wave packet from the one in equations (1.14) and (1.15), where 
the quadratic term in co(k) is omitted. The integral over k may be evaluated 
using equation (A. 8), giving the result 

OO 

e i(k 0 x-a)ot) (*(* 

W y (x, 0 =-W(§, 0)e- lko h- (x - v *‘-® / 4 w d§ (1.28) 

2y/utyt J. 

— 00 

Equation (1.28) relates the wave packet at time t to the wave packet at time 
t = 0 if the A:-dependence of the angular frequency includes terms up to k 2 . 
The profile of the wave packet fif,(x, /) changes as time progresses because of 
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the factor f l/2 before the integral and the t in the exponent within the integral. 
If we select a specific form for the wave packet at time t = 0, the nature of this 
time dependence becomes more evident. 


Gaussian wave packet 

Let us suppose that W(x, 0) has the gaussian distribution (1.20) as its profile, so 
that equation (1.14) at time t = 0 is 


0) = 0) = —L= Q ik ^ e - a2 Z 2 / 2 

Substitution of equation (1.29) into (1.28) gives 


(1.29) 


x V/(x, t ) = 


J(k 0 x-aj 0 1 ) 


Q-a 2 ?/2 e -(x-v g t-£f /4iyt ^ 


7 ’ ’ 2n\J2xyt 

The integral may be evaluated using equation (A. 8) accompanied with some 
tedious, but straightforward algebraic manipulations, yielding 

e i(k 0 x-m 0 t) 


W y (x, t) = 


+ 2i a 2 yt) 


a ■ —a 2 (x— Vg t) 2 /2( 1 +2ia 2 y t) 


(1.30) 


The wave packet, then, consists of the plane wave expi[A' 0 x — co 0 t] with its 
amplitude modulated by 

_ 1 _ e -a 2 (x-v g tf/2(l+2ia 2 yt) 

+ 2i a 2 yt) 


which is a complex function that depends on the time t. When y equals zero so 
that the quadratic term in a>(k) is neglected, this complex modulating function 
reduces to B(x, t) in equation (1.20). The absolute value |Tf,(x, /)| of the wave 
packet (1.30) is given by 


\Wy(x, 01 


_ 1 _ —a 2 (x— u g /) 2 /2( 1 +4a 4 y 2 1 2 ) 

(2jr) 1 / 2 (l Ada 4 / 2 / 2 ) 1 / 4 


(1.31) 


We now contrast the behavior of the wave packet in equation (1.31) with that 
of the wave packet in (1.20). At any time t, the maximum amplitudes of both 
occur at x = v g t and travel in the positive x-direction with the same group 
velocity v g . However, at that time t, the value of |’T y (x, t)\ is 1/e of its 
maximum value when the exponent in equation (1.31) is unity, so that the half 
width or uncertainty Ax for | x T r y (x, t)\ is given by 

Ax = |x — v g t\ = — \J 1 + 4 a 4 y 2 t 2 

Moreover, the maximum amplitude for |TL(x, /)| at time l is given by 

(2jt)“ 1/2 (1 +4a 4 y 2 f 2 )“ 1/4 
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As time increases from — oo to 0, the half width of the wave packet | x F y (x, /)| 
continuously decreases and the maximum amplitude continuously increases. At 
t = 0 the half width attains its lowest value of y/l/a and the maximum 
amplitude attains its highest value of 1 / a/2tt, and both values are in agreement 
with the wave packet in equation (1.20). As time increases from 0 to oo, the 
half width continuously increases and the maximum amplitude continuously 
decreases. Thus, as t 2 increases, the wave packet [’fi-hx, f)| remains gaussian 
in shape, but broadens and flattens out in such a way that the area under the 
square j'P-Tx, t) | 2 of the wave packet remains constant over time at a value of 
(2\fjw,) in agreement with Parseval’s theorem (1.18). 

The product AxA k for this spreading wave packet y V y (x, t) is 

Ax A A' = 2sJ\ + 4a 4 y 2 ? 2 

and increases as 1 1\ increases. Thus, the value of the right-hand side when / = 0 
is the lower bound for the product AxA A; and is in agreement with the 
uncertainty relation (1.23). 


1.4 Particles and waves 


To explain the photoelectric effect, Einstein (1905) postulated that light, or 
electromagnetic radiation, consists of a beam of particles, each of which travels 
at the same velocity c (the speed of light), where c has the value 

c = 2.99792 X 10 8 ms” 1 

Each particle, later named a photon, has a characteristic frequency v and an 
energy hv, where h is Planck’s constant with the value 

h = 6.62608 X 10” 34 Js 


The constant h and the hypothesis that energy is quantized in integral multiples 
of hv had previously been introduced by M. Planck (1900) in his study of 
blackbody radiation. 1 In terms of the angular frequency at defined in equation 
(1.2), the energy E of a photon is 

E = hco (1.32) 


where h is defined by 

h = ^-= 1.05457 X 10” 34 Js 
2jt. 

Because the photon travels with velocity c, its motion is governed by relativity 


1 The history of the development of quantum concepts to explain observed physical phenomena, which 
occurred mainly in the first three decades of the twentieth century, is discussed in introductory texts on 
physical chemistry and on atomic physics. A much more detailed account is given in M. Jammer (1966) 
The Conceptual Development of Quantum Mechanics (McGraw-Hill, New York). 
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theory, which requires that its rest mass be zero. The magnitude of the 
momentum p for a particle with zero rest mass is related to the relativistic 
energy E by p = E/c, so that 

E hv hco 

P c c c 

Since the velocity c equals io/k , the momentum is related to the wave number 
k for a photon by 

p = hk (1.33) 


Einstein’s postulate was later confirmed experimentally by A. Compton (1924). 

Noting that it had been fruitful to regard light as having a corpuscular nature, 
L. de Broglie (1924) suggested that it might be useful to associate wave-like 
behavior with the motion of a particle. He postulated that a particle with linear 
momentum p be associated with a wave whose wavelength X is given by 


, 2n h 

1 = t=- p 


(1.34) 


and that expressions (1.32) and (1.33) also apply to particles. The hypothesis of 
wave properties for particles and the de Broglie relation (equation (1.34)) have 
been confirmed experimentally for electrons by G. P. Thomson (1927) and by 
Davisson and Germer (1927), for neutrons by E. Fermi and L. Marshall (1947), 
and by W. H. Zinn (1947), and for helium atoms and hydrogen molecules by I. 
Estermann, R. Frisch, and O. Stern (1931). 

The classical, non-relativistic energy E for a free particle, i.e., a particle in 
the absence of an external force, is expressed as the sum of the kinetic and 
potential energies and is given by 

1 7 p 2 

E = -mv 2 + V = — +V (1.35) 

2 2m 


where m is the mass and v the velocity of the particle, the linear momentum p 
is 


p = mv 

and V is a constant potential energy. The force F acting on the particle is given 
by 


and vanishes because V is constant. In classical mechanics the choice of the 
zero-level of the potential energy is arbitrary. Since the potential energy for the 
free particle is a constant, we may, without loss of generality, take that constant 
value to be zero, so that equation (1.35) becomes 
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p 2 

E = V~ 

2 m 


(1.36) 


Following the theoretical scheme of Schrodinger, we associate a wave packet 
V(x, t) with the motion in the x-direction of this free particle. This wave 
packet is readily constructed from equation (1.11) by substituting (1.32) and 
(1.33) for a) and k, respectively 

1 i'°° 

" ' ' (1.37) 


x V(x, t) = 




A(p)e' (px ~ Et) / fl dp 


where, for the sake of symmetry between v F(x, t) and A (p), a factor h ^ 1 /2 has 
been absorbed into A(p). The function A(k) in equation (1.12) is now 
h l / 2 A(p), so that 

1 POO 

A(p) = -= x V(x, t) e -'(P x - E, )/ h dx (1.38) 

\[2jtfl J—00 

The law of dispersion for this wave packet may be obtained by combining 
equations (1.32), (1.33), and (1.36) to give 

" 2 hk 2 

2 m 


P 


° >{k) h 2 mil 


(1.39) 


This dispersion law with a> proportional to k 2 is different from that for 
undispersed light waves, where a> is proportional to k. 

If (o{k ) in equation (1.39) is expressed as a power series in k — A' 0 , we obtain 


hkT, hkt) h 9 

a)(k) = - (k- k 0 ) + —(k~ k 0 ) 

2 m m 2 m 


(1.40) 


This expansion is exact; there are no terms of higher order than quadratic. 
From equation (1.40) we see that the phase velocity t> p h of the wave packet is 
given by 

m 0 


hk o 

ph A'o 2 m 


and the group velocity v„ is 


v„ = 


dm 

dk 


hkr ) 


m 


(1.41) 


(1.42) 


7 = 


(1.43) 


while the parameter y of equations (1.28), (1.30), and (1.31) is 

1 f d 2 m\ h 

2 \dA' 2 /o 2m 

If we take the derivative of co(k) in equation (1.39) with respect to k and use 
equation (1.33), we obtain 

dm hk p 

2TT = — = - =v 
d k m m 



7.5 Heisenberg uncertainty principle 


21 


Thus, the velocity v of the particle is associated with the group velocity v g of 
the wave packet 


v=v g 


If the constant potential energy Fin equation (1.35) is set at some arbitrary 
value other then zero, then equation (1.39) takes the form 

hk 2 V 
2 m n 


and the phase velocity becomes 

iik() V 
ph 2m hk 0 

Thus, both the angular frequency o>(k) and the phase velocity u p h are 
dependent on the choice of the zero-level of the potential energy and are 
therefore arbitrary; neither has a physical meaning for a wave packet represent¬ 
ing a particle. 

Since the parameter y is non-vanishing, the wave packet will disperse with 
time as indicated by equation (1.28). For a gaussian profile, the absolute value 
of the wave packet is given by equation (1.31) with y given by (1.43). We note 
that y is proportional to m ', so that as m becomes larger, y becomes smaller. 
Thus, for heavy particles the wave packet spreads slowly with time. 

As an example, the value of y for an electron, which has a mass of 
9.11 X 1CT 31 kg, is 5.78 X 10 5 m 2 s 1 . For a macroscopic particle whose 
mass is approximately a microgram, say 9.11 X 10” 10 kg in order to make the 
calculation easier, the value of y is 5.78 X 10” 26 m 2 s _1 . The ratio of the 
macroscopic particle to the electron is 10 21 . The time dependence in the 
dispersion terms in equations (1.31) occurs as the product yt. Thus, for the 
same extent of spreading, the macroscopic particle requires a factor of 10 21 
longer than the electron. 


1.5 Heisenberg uncertainty principle 

Since a free particle is represented by the wave packet x ¥(x, t ), we may regard 
the uncertainty Ax in the position of the wave packet as the uncertainty in the 
position of the particle. Likewise, the uncertainty A k in the wave number is 
related to the uncertainty Ap in the momentum of the particle by A A- = A p/h. 
The uncertainty relation (1.23) for the particle is, then 

AxAp^h (1.44) 

This relationship is known as the Heisenberg uncertainty principle. 

The consequence of this principle is that at any instant of time the position 
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of the particle is defined only as a range Ax and the momentum of the particle 
is defined only as a range A p. The product of these two ranges or ‘uncertain¬ 
ties’ is of order h or larger. The exact value of the lower bound is dependent on 
how the uncertainties are defined. A precise definition of the uncertainties in 
position and momentum is given in Sections 2.3 and 3.10. 

The Heisenberg uncertainty principle is a consequence of the stipulation that 
a quantum particle is a wave packet. The mathematical construction of a wave 
packet from plane waves of varying wave numbers dictates the relation (1.44). 
It is not the situation that while the position and the momentum of the particle 
are well-defined, they cannot be measured simultaneously to any desired degree 
of accuracy. The position and momentum are, in fact, not simultaneously 
precisely defined. The more precisely one is defined, the less precisely is the 
other, in accordance with equation (1.44). This situation is in contrast to 
classical-mechanical behavior, where both the position and the momentum can, 
in principle, be specified simultaneously as precisely as one wishes. 

In quantum mechanics, if the momentum of a particle is precisely specified 
so that p = po and Ap = 0, then the function A(p ) is 

A(p) = d(p - po) 


The wave packet (1.37) then becomes 

1 


W(x, t ) = 




dip - P o)Q l{px - El)ltl dp = 




J( pox—Et)/h 


which is a plane wave with wave number po/h and angular frequency E/h. 
Such a plane wave has an infinite value for the uncertainty Ax. Likewise, if the 
position of a particle is precisely specified, the uncertainty in its momentum is 
infinite. 

Another Heisenberg uncertainty relation exists for the energy E of a particle 
and the time t at which the particle has that value for the energy. The 
uncertainty A to in the angular frequency of the wave packet is related to the 
uncertainty A E in the energy of the particle by Am = A E/h, so that the 
relation (1.25) when applied to a free particle becomes 

AEAt 5= h (1.45) 


Again, this relation arises from the representation of a particle by a wave 
packet and is a property of Fourier transforms. 

The relation (1.45) may also be obtained from (1.44) as follows. The 
uncertainty A E is the spread of the kinetic energies in a wave packet. If A/? is 
small, then A E is related to Ap by 



P_ 

m 


Ap 


AE = A 


(1.46) 
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The time At for a wave packet to pass a given point equals the uncertainty in 
its position x divided by the group velocity v„ 


Ax Ax m . 

A t = — = — = —Ax 
v g V p 


(1-47) 


Combining equations (1.46) and (1.47), we see that AEAt = AxA p. Thus, the 
relation (1.45) follows from (1.44). The Heisenberg uncertainty relation (1.45) 
is treated more thoroughly in Section 3.10. 


1.6 Young’s double-slit experiment 

The essential features of the particle-wave duality are clearly illustrated by 
Young’s double-slit experiment. In order to explain all of the observations of 
this experiment, light must be regarded as having both wave-like and particle¬ 
like properties. Similar experiments on electrons indicate that they too possess 
both particle-like and wave-like characteristics. The consideration of the 
experimental results leads directly to a physical interpretation of Schrodinger’s 
wave function, which is presented in Section 1.8. 

The experimental apparatus is illustrated schematically in Figure 1.8. Mono¬ 
chromatic light emitted from the point source S is focused by a lens L onto a 
detection or observation screen D. Between L and D is an opaque screen with 
two closely spaced slits A and B, each of which may be independently opened 
or closed. 

A monochromatic light beam from S passing through the opaque screen with 
slit A open and slit B closed gives a diffraction pattern on D with an intensity 
distribution I A as shown in Figure 1.9(a). In that figure the points A and B are 
directly in line with slits A and B, respectively. If slit A is closed and slit B 
open, the intensity distribution of the diffraction pattern is given by the curve 
labeled In in Figure 1.9(a). For an experiment in which slit A is open and slit 
B is closed half of the time, while slit A is closed and slit B is open the other 
half of the time, the resulting intensity distribution is the sum of I A and In, as 
shown in Figure 1.9(b). However, when both slits are open throughout an 

D 



Figure 1.8 Diagram of Young’s double-slit experiment. 
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Figure 1.9 (a) Intensity distributions I a from slit A alone and /b from slit B alone, (h) 
The sum of the intensity distributions I a and Iq. (c) The intensity interference pattern 
when slits A and B are open simultaneously. 


experiment, an interference pattern as shown in Figure 1.9(c) is observed. The 
intensity pattern in this case is not the sum I a + Ib, but rather an alternating 
series of bright and dark interference fringes with a bright maximum midway 
between points A and B. The spacing of the fringes depends on the distance 
between the two slits. 

The wave theory for light provides a satisfactory explanation for these 
observations. It was, indeed, this very experiment conducted by T. Young 
(1802) that, in the nineteenth century, led to the replacement of Newton’s 
particle theory of light by a wave theory. 

The wave interpretation of the interference pattern observed in Young’s 
experiment is inconsistent with the particle or photon concept of light as 
required by Einstein’s explanation of the photoelectric effect. If the monochro¬ 
matic beam of light consists of a stream of individual photons, then each 
photon presumably must pass through either slit A or slit B. To test this 
assertion, detectors are placed directly behind slits A and B and both slits are 
opened. The light beam used is of such low intensity that only one photon at a 
time is emitted by S. In this situation each photon is recorded by either one 
detector or the other, never by both at once. Half of the photons are observed to 
pass through slit A, half through slit B in random order. This result is consistent 
with particle behavior. 

How then is a photon passing through only one slit influenced by the other 
slit to produce an interference pattern? A possible explanation is that somehow 
photons passing through slit A interact with other photons passing through slit 
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B and vice versa. To answer this question, Young’s experiment is repeated with 
both slits open and with only one photon at a time emitted by S. The elapsed 
time between each emission is long enough to rule out any interactions among 
the photons. While it might be expected that, under these circumstances, the 
pattern in Figure 1.9(6) would be obtained, in fact the interference fringes of 
Figure 1.9(c) are observed. Thus, the same result is obtained regardless of the 
intensity of the light beam, even in the limit of diminishing intensity. 

If the detection screen D is constructed so that the locations of individual 
photon impacts can be observed (with an array of scintillation counters, for 
example), then two features become apparent. The first is that only whole 
photons are detected; each photon strikes the screen D at only one location. 
The second is that the interference pattern is slowly built up as the cumulative 
effect of very many individual photon impacts. The behavior of any particular 
photon is unpredictable; it strikes the screen at a random location. The density 
of the impacts at each point on the screen D gives the interference fringes. 
Looking at it the other way around, the interference pattern is the probability 
distribution of the location of the photon impacts. 

If only slit A is open half of the time and only slit B the other half of the 
time, then the interference fringes are not observed and the diffraction pattern 
of Figure 1.9(6) is obtained. The photons passing through slit A one at a time 
form in a statistical manner the pattern labeled I a in Figure 1.9(a), while those 
passing through slit B yield the pattern / B . If both slits A and B are left open, 
but a detector is placed at slit A so that we know for certain whether each given 
photon passes through slit A or through slit B, then the interference pattern is 
again not observed; only the pattern of Figure 1.9(6) is obtained. The act of 
ascertaining through which slit the photon passes has the same effect as closing 
the other slit. 

The several variations on Young’s experiment cannot be explained exclu¬ 
sively by a wave concept of light nor by a particle concept. Both wave and 
particle behavior are needed for a complete description. When the photon is 
allowed to pass undetected through the slits, it displays wave behavior and an 
interference pattern is observed. Typical of particle behavior, each photon 
strikes the detection screen D at a specific location. However, the location is 
different for each photon and the resulting pattern for many photons is in 
accord with a probability distribution. When the photon is observed or 
constrained to pass through a specific slit, whether the other slit is open or 
closed, the behavior is more like that of a particle and the interference fringes 
are not observed. It should be noted, however, that the curve I a in Figure 
1.9(a) is the diffraction pattern for a wave passing through a slit of width 
comparable to the wavelength of the wave. Thus, even with only one slit open 
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and with the photons passing through the slit one at a time, wave behavior is 
observed. 

Analogous experiments using electrons instead of photons have been carried 
out with the same results. Electrons passing through a system with double slits 
produce an interference pattern. If a detector determines through which slit 
each electron passes, then the interference pattern is not observed. As with the 
photon, the electron exhibits both wave-like and particle-like behavior and its 
location on a detection screen is randomly determined by a probability 
distribution. 


1.7 Stern-Gerlach experiment 

Another experiment that relates to the physical interpretation of the wave 
function was performed by O. Stern and W. Gerlach (1922). Their experiment 
is a dramatic illustration of a quantum-mechanical effect which is in direct 
conflict with the concepts of classical theory. It was the first experiment of a 
non-optical nature to show quantum behavior directly. 

In the Stern-Gerlach experiment, a beam of silver atoms is produced by 
evaporating silver in a high-temperature oven and allowing the atoms to escape 
through a small hole. The beam is further collimated by passage through a 
series of slits. As shown in Figure 1.10, the beam of silver atoms then passes 
through a highly inhomogeneous magnetic field and condenses on a detection 
plate. The cross-section of the magnet is shown in Figure 1.11. One pole has a 
very sharp edge in order to produce a large gradient in the magnetic field. The 
atomic beam is directed along this edge (the z-axis) so that the silver atoms 
experience a gradient in magnetic field in the vertical or x-direction, but not in 
the horizontal or j’-direction. 

Silver atoms, being paramagnetic, have a magnetic moment M. In a 
magnetic field B, the potential energy Vof each atom is 

V = -MB 

Between the poles of the magnet, the magnetic field B varies rapidly in the x- 
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Figure 1.10 Diagram of the Stern-Gerlach experiment. 
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Figure 1.11 A cross-section of the magnet in Figure 1.10. 


direction, resulting in a force F x in the x-direction acting on each silver atom. 
This force is given by 


F x = - 


dV_ 

dx 


M cos 0 


dB 

dx 


where M and B are the magnitudes of the vectors M and B and 0 is the angle 
between the direction of the magnetic moment and the positive x-axis. Thus, 
the inhomogeneous magnetic field deflects the path of a silver atom by an 
amount dependent on the orientation angle 0 of its magnetic moment. If the 
angle 0 is between 0° and 90°, then the force is positive and the atom moves in 
the positive x-direction. For an angle 0 between 90° and 180°, the force is 
negative and the atom moves in the negative x-direction. 

As the silver atoms escape from the oven, their magnetic moments are 
randomly oriented so that all possible values of the angle 0 occur. According to 
classical mechanics, we should expect the beam of silver atoms to form, on the 
detection plate, a continuous vertical line, corresponding to a gaussian distribu¬ 
tion of impacts with a maximum intensity at the center (x = 0). The outer 
limits of this line would correspond to the magnetic moment of a silver atom 
parallel (0 = 0°) and antiparallel (0 = 180°) to the magnetic field gradient 
( dB/dx ). What is actually observed on the detection plate are two spots, 
located at each of the outer limits predicted by the classical theory. Thus, the 
beam of silver atoms splits into two distinct components, one corresponding to 
0 = 0°, the other to 0 = 180°. There are no trajectories corresponding to 
intermediate values of 0. There is nothing unique or special about the vertical 
direction. If the magnet is rotated so that the magnetic field gradient is along 
the y-axis, then again only two spots are observed on the detection plate, but 
are now located on the horizontal axis. 

The Stern-Gerlach experiment shows that the magnetic moment of each 
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silver atom is found only in one of two orientations, either parallel or 
antiparallel to the magnetic field gradient, even though the magnetic moments 
of the atoms are randomly oriented when they emerge from the oven. Thus, the 
possible orientations of the atomic magnetic moment are quantized, i.e., only 
certain discrete values are observed. Since the direction of the quantization is 
determined by the direction of the magnetic field gradient, the experimental 
process itself influences the result of the measurement. This feature occurs in 
other experiments as well and is characteristic of quantum behavior. 

If the beam of silver atoms is allowed to pass sequentially between the poles 
of two or three magnets, additional interesting phenomena are observed. We 
describe here three such related experimental arrangements. In the first 
arrangement the collimated beam passes through a magnetic field gradient 
pointing in the positive x-direction. One of the two exiting beams is blocked 
(say the one with antiparallel orientation), while the other (with parallel 
orientation) passes through a second magnetic field gradient which is parallel 
to the first. The atoms exiting the second magnet are deposited on a detection 
plate. In this case only one spot is observed, because the magnetic moments of 
the atoms entering the second magnetic field are all oriented parallel to the 
gradient and remain parallel until they strike the detection plate. 

The second arrangement is the same as the first except that the gradient of 
the second magnetic field is along the positive y-axis, i.e., it is perpendicular to 
the gradient of the first magnetic field. For this arrangement, two spots of silver 
atoms appear on the detection plate, one to the left and one to the right of the 
vertical x-axis. The beam leaving the first magnet with all the atomic magnetic 
moments oriented in the positive x-direction is now split into two equal beams 
with the magnetic moments oriented parallel and antiparallel to the second 
magnetic field gradient. 

The third arrangement adds yet another vertical inhomogeneous magnetic 
field to the setup of the second arrangement. In this new arrangement the 
collimated beam of silver atoms coming from the oven first encounters a 
magnetic field gradient in the positive x-direction, which splits the beam 
vertically into two parts. The lower beam is blocked and the upper beam passes 
through a magnetic field gradient in the positive y-direction. This beam is split 
horizontally into two parts. The left beam is blocked and the right beam is now 
directed through a magnetic field gradient parallel to the first one, i.e., oriented 
in the positive x-direction. The resulting pattern on the detection plate might be 
expected to be a single spot, corresponding to the magnetic moments of all 
atoms being aligned in the positive x-direction. What is observed in this case, 
however, are two spots situated on a vertical axis and corresponding to atomic 
magnetic moments aligned in equal numbers in both the positive and negative 
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x-directions. The passage of the atoms through the second magnet apparently 
realigned their magnetic moments parallel and antiparallel to the positive y- 
axis and thereby destroyed the previous information regarding their alignment 
by the first magnet. 

The original Stern-Gerlach experiment has also been carried out with the 
same results using sodium, potassium, copper, gold, thallium, and hydrogen 
atoms in place of silver atoms. Each of these atoms, including silver, has a 
single unpaired electron among the valence electrons surrounding its nucleus 
and core electrons. In hydrogen, of course, there is only one electron about the 
nucleus. The magnetic moment of such an atom is due to the intrinsic angular 
momentum, called spin, of this odd electron. The quantization of the magnetic 
moment by the inhomogeneous magnetic field is then the quantization of this 
electron spin angular momentum. The spin of the electron and of other 
particles is discussed in Chapter 7. 

Since the splitting of the atomic beam in the Stern-Gerlach experiment is 
due to the spin of an unpaired electron, one might wonder why a beam of 
electrons is not used directly rather than having the electrons attached to atoms. 
In order for a particle to pass between the poles of a magnet and be deflected 
by a distance proportional to the force acting on it, the trajectory of the particle 
must be essentially a classical path. As discussed in Section 1.4, such a particle 
is described by a wave packet and wave packets disperse with time-the lighter 
the particle, the faster the dispersion and the greater the uncertainty in the 
position of the particle. The application of Heisenberg’s uncertainty principle 
to an electron beam shows that, because of the small mass of the electron, it is 
meaningless to assign a magnetic moment to a free electron. As a result, the 
pattern on the detection plate from an electron beam would be sufficiently 
diffuse from interference effects that no conclusions could be drawn. 2 How¬ 
ever, when the electron is bound unpaired in an atom, then the atom, having a 
sufficiently larger mass, has a magnetic moment and an essentially classical 
path through the Stern-Gerlach apparatus. 


1.8 Physical interpretation of the wave function 

Young’s double-slit experiment and the Stern-Gerlach experiment, as de¬ 
scribed in the two previous sections, lead to a physical interpretation of the 
wave function associated with the motion of a particle. Basic to the concept of 
the wave function is the postulate that the wave function contains all the 


2 This point is discussed in more detail in N. F. Mott and H. S. W. Massey (1965) The Theory of Atomic 
Collisions, 3rd edition, p. 215 — 16, (Oxford University Press, Oxford). 



30 


The wave function 


information that can be known about the particle that it represents. The wave 
function is a complete description of the quantum behavior of the particle. For 
this reason, the wave function is often also called the state of the system. 

In the double-slit experiment, the patterns observed on the detection screen 
are slowly built up from many individual particle impacts, whether these 
particles are photons or electrons. The position of the impact of any single 
particle cannot be predicted; only the cumulative effect of many impacts is 
predetermined. Accordingly, a theoretical interpretation of the experiment must 
involve probability distributions rather than specific particle trajectories. The 
probability that a particle will strike the detection screen between some point x 
and a neighboring point x + dx is P(x) dx and is proportional to the range dx. 
The larger the range dx, the greater the probability for a given particle to strike 
the detection screen in that range. The proportionality factor P(x) is called the 
probability density and is a function of the position x. For example, the 
probability density P(x) for the curve I a in Figure 1.9(a) has a maximum at 
the point A and decreases symmetrically on each side of A. 

If the motion of a particle in the double-slit experiment is to be represented 
by a wave function, then that wave function must determine the probability 
density P(x). For mechanical waves in matter and for electromagnetic waves, 
the intensity of a wave is proportional to the square of its amplitude. By 
analogy, the probability density P(x) is postulated to be the square of the 
absolute value of the wave function 'Ifix) 

P(x) = |'I'(x)| 2 = T' ;i: (x)’I'(x) 

On the basis of this postulate, the interference pattern observed in the double¬ 
slit experiment can be explained in terms of quantum particle behavior. 

A particle, photon or electron, passing through slit A and striking the 
detection screen at point x has wave function Ta(x), while a similar particle 
passing through slit B has wave function 'Pb(x). Since a particle is observed to 
retain its identity and not divide into smaller units, its wave function l F(x) is 
postulated to be the sum of the two possibilities 

W(x) = W a (x) + ?b(x) (1.48) 

When only slit A is open, the particle emitted by the source S passes through 
slit A, thereby causing the wave function 'F(x) in equation (1.48) to change or 
collapse suddenly to T A (x). The probability density P A (x) that the particle 
strikes point x on the detection screen is, then 

Pa 0) = I^aCOI 2 

and the intensity distribution I a in Figure 1.9(a) is obtained. When only slit B 
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is open, the particle passes through slit B and the wave function W(x) collapses 
to WbCi). The probability density Pb(x) is then given by 

Pb(x) = |^b(x)| 2 

and curve 7 b in Figure 1.9(a) is observed. If slit A is open and slit B closed 
half of the time, and slit A is closed and slit B open the other half of the time, 
then the resulting probability density on the detection screen is just 

Pa(x) + P B (x) = \W A (x)\ 2 + |^b(x)| 2 
giving the curve in Figure 1.9 (b). 

When both slits A and B are open at the same time, the interpretation 
changes. In this case, the probability density P AB (x) is 

Pab(x) = I'Fa(x) + 'F B (x)| 2 

= |’Ta(x)| 2 + |'Fb(x-)| 2 + W*(x)W B (x) + W*(x)W A (x) 

= P A (x) + Pb(x) + :7ab(x) (1.49) 

where 

^ab(x) = W*(r)f B (x) + W*(x)W A (x) 

The probability density Pab(x) has an interference term ■9'ab(x) in addition to 
the terms P A (x) and Pb(x). This interference term is real and is positive for 
some values of x, but negative for others. Thus, the term :7ab(x) modifies the 
sum P A (x) + Pb (x) to give an intensity distribution with interference fringes as 
shown in Figure 1.9(c). 

For the experiment with both slits open and a detector placed at slit A, the 
interaction between the wave function and the detector must be taken into 
account. Any interaction between a particle and observing apparatus modifies 
the wave function of the particle. In this case, the wave function has the form 
of a wave packet which, according to equation (1.37), oscillates with time as 
e -i Et/ty During the time period At that the particle and the detector are 
interacting, the energy of the interacting system is uncertain by an amount A E, 
which, according to the Heisenberg energy-time uncertainty principle, equa¬ 
tion (1.45), is related to At by A E 2= h/At. Thus, there is an uncertainty in the 
phase Et/h of the wave function and fiVM is replaced by e'' in F A (x), where tp 
is real. The value of tp varies with each particle-detector interaction and is 
totally unpredictable. Therefore, the wave function W(x) for a particle in this 
experiment is 

W(x) = e'^ A (x) + W B (x) (1.50) 

and the resulting probability density P (p (x) is 
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P <p (x) = |V A (x)| 2 + |^ B (x)| 2 + e-'^*(x)W B W + e^*(« X ) 

= P\(x) + P B (x) + :7 (p (x) (1.51) 

where -7 <p (x) is defined by 

■7 (p (x) = bW + 

The interaction with the detector at slit A has changed the interference term 
from ./ab(x) to .7 (p (x). 

For any particular particle leaving the source S and ultimately striking the 
detection screen D, the value of <p is determined by the interaction with the 
detector at slit A. However, this value is not known and cannot be controlled; 
for all practical purposes it is a randomly determined and unverifiable number. 
The value of tp does, however, influence the point x where the particle strikes 
the detection screen. The pattern observed on the screen is the result of a large 
number of impacts of particles, each with wave function ^(x) in equation 
(1.50), but with random values for tp. In establishing this pattern, the term 
:7 (p (x) in equation (1.51) averages to zero. Thus, in this experiment the 
probability density P ip (x) is just the sum of P A (x) and P B (x), giving the 
intensity distribution shown in Figure 1.9(b). 

In comparing the two experiments with both slits open, we see that interact¬ 
ing with the system by placing a detector at slit A changes the wave function of 
the system and the experimental outcome. This feature is an essential char¬ 
acteristic of quantum theory. We also note that without a detector at slit A, 
there are two indistinguishable ways for the particle to reach the detection 
screen D and the two wave functions v F A (x) and 'PbU') are added together. 
With a detector at slit A, the two paths are distinguishable and it is the 
probability densities P A (x) and P B (x) that are added. 

An analysis of the Stern-Gerlach experiment also contributes to the 
interpretation of the wave function. When an atom escapes from the high- 
temperature oven, its magnetic moment is randomly oriented. Before this atom 
interacts with the magnetic field, its wave function 'B is the weighted sum of 
two possible states a and /3 

W = c a a + cpl3 (1.52) 

where c a and cy; are constants and are related by 

\ c a\“ + \Cfi\ 2 = 1 

In the presence of the inhomogeneous magnetic field, the wave function R* 
collapses to either a or (3 with probabilities |c a | 2 and | cy; | 2 , respectively. The 
state a corresponds to the atomic magnetic moment being parallel to the 
magnetic field gradient, the state (3 being antiparallel. Regardless of the 
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orientation of the magnetic field gradient, vertical (up or down), horizontal (left 
or right), or any angle in between, the wave function of the atom is always 
given by equation (1.52) with a parallel and /; antiparallel to the magnetic field 
gradient. Since the atomic magnetic moments are initially randomly oriented, 
half of the wave functions collapse to a and half to [i. 

In the Stern-Gerlach experiment with two magnets having parallel magnetic 
field gradients-the ‘first arrangement’ described in Section 1.7-all the atoms 
entering the second magnet are in state a and therefore are all deflected in the 
same direction by the second magnetic field gradient. Thus, it is clear that the 
wave function T ; before any interaction is permanently changed by the inter¬ 
action with the first magnet. 

In the ‘second arrangement’ of the Stern-Gerlach experiment, the atoms 
emerging from the first magnet and entering the second magnet are all in the 
same state, say a. (Recall that the other beam of atoms in state ft is blocked.) 
The wave function a may be regarded as the weighted sum of two states a' 
and /?' 

a = c' a a' + c'pf)' 

where a' and /?' refer to states with atomic magnetic moments parallel and 
antiparallel, respectively, to the second magnetic field gradient and where c' a 
and c’i are constants related by 

| Ca\~ + \ c 'p\~ = 1 

In the ‘second arrangement’, the second magnetic field gradient is perpendicu¬ 
lar to the first, so that 

\c' a \ 2 = \c' p \ 2 = \ 

and 

a = -J=(a'±/T) 

The interaction of the atoms in state a with the second magnet collapses the 
wave function a to either a' or /?' with equal probabilities. 

In the ‘third arrangement’, the right beam of atoms emerging from the 
second magnet (all atoms being in state a'), passes through a third magnetic 
field gradient parallel to the first. In this case, the wave function a' may be 
expressed as the sum of states a and /3 

a' =-^=(a±/3) 

The interaction between the third magnetic field gradient and each atom 
collapses the wave function a' to either a or /3 with equal probabilities. 

The interpretation of the various arrangements in the Stern-Gerlach experi- 
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ment reinforces the postulate that the wave function for a particle is the sum of 
indistinguishable paths and is modified when the paths become distinguishable 
by means of a measurement. The nature of the modification is the collapse of 
the wave function to one of its components in the sum. Moreover, this new 
collapsed wave function may be expressed as the sum of subsequent indis¬ 
tinguishable paths, but remains unchanged if no further interactions with 
measuring devices occur. 

This statistical interpretation of the significance of the wave function was 
postulated by M. Born (1926), although his ideas were based on some 
experiments other than the double-slit and Stern-Gerlach experiments. The 
concepts that the wave function contains all the information known about the 
system it represents and that it collapses to a different state in an experimental 
observation were originated by W. Heisenberg (1927). These postulates regard¬ 
ing the meaning of the wave function are part of what has become known as 
the Copenhagen interpretation of quantum mechanics. While the Copenhagen 
interpretation is disputed by some scientists and philosophers, it is accepted by 
the majority of scientists and it provides a consistent theory which agrees with 
all experimental observations to date. We adopt the Copenhagen interpretation 
of quantum mechanics in this book. 3 


Problems 

1.1 The law of dispersion for surface waves on a sheet of water of uniform depth d is 4 

cj(k) — (gf'tanh dk) ] ' 2 

where g is the acceleration due to gravity. What is the group velocity of the 
resultant composite wave? What is the limit for deep water (dk 5= 4)? 

1.2 The phase velocity for a particular wave is u ph — A/X, where A is a constant. What 
is the dispersion relation? What is the group velocity? 

1.3 Show that 

•OO 

A{k)dk = 1 

— OO 

for the gaussian function A(k) in equation (1.19). 


3 The historical and philosophical aspects of the Copenhagen interpretation are more extensively discussed 
in J. Baggott (1992) The Meaning of Quantum Theory (Oxford University Press, Oxford). 

4 For a derivation, see H. Lamb (1932) Hydrodynamics, pp. 363-81 (Cambridge University Press, Cam¬ 
bridge). 
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1.4 Show that the average value of k is k 0 for the gaussian function A(k) in equation 
(1.19). 

1.5 Show that the gaussian functions A(k) and x V(x, t ) obey Parseval’s theorem (1.18). 

1.6 Show that the square pulse A(k ) in equation (1.21) and the corresponding function 
x V(x, t ) obey Parseval’s theorem. 
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Schrodinger wave mechanics 


2.1 The Schrodinger equation 


In the previous chapter we introduced the wave function to represent the 
motion of a particle moving in the absence of an external force. In this chapter 
we extend the concept of a wave function to make it apply to a particle acted 
upon by a non-vanishing force, i.e., a particle moving under the influence of a 
potential which depends on position. The force F acting on the particle is 
related to the potential or potential energy V (x) by 




( 2 . 1 ) 


As in Chapter 1, we initially consider only motion in the x-direction. In Section 
2.7, however, we extend the formalism to include three-dimensional motion. 

In Chapter 1 we associated the wave packet 


fl'U, t) = 


1 


A(p)^ px ~ Et)/h dp 


( 2 . 2 ) 


\J 2jth. 

with the motion in the x-direction of a free particle, where the weighting factor 
A(p) is given by 


1 f°° 

A(p) = —= flflx, t)e-' (px - El)/fl dx (2.3) 

s/2jtfi J—oo 

This wave packet satisfies a partial differential equation, which will be used as 
the basis for the further development of a quantum theory. To find this 
differential equation, we first differentiate equation (2.2) twice with respect to 
the distance variable x to obtain 


d 2 xy 


-1 


p 2 A(p)Q' (px - El)/h dp 


dx 2 ^2jih 5 . _ 

Differentiation of (2.2) with respect to the time t gives 


(2.4) 
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dW 

dt 
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EA(p)e' (px - E,)/fl dp 


—l 


(2.5) 


V2jrh 3 , 

The total energy E for a free particle (i.e., for a particle moving in a region of 
constant potential energy V) is given by 

P 1 

E = ^+V 
2 m 

which may be combined with equations (2.4) and (2.5) to give 

vw 


dW _ h 2 d 2 W 
dt 2m dx 2 


Schrodinger (1926) postulated that this differential equation is also valid 
when the potential energy is not constant, but is a function of position. In that 
case the partial differential equation becomes 
d x ¥(x, t ) fr d 2x ¥(x, t) 


i h- 


+ V(x)W(x, t) 


( 2 . 6 ) 


dt 2m dx 2 

which is kn own as the time-dependent Schrodinger equation. The solutions 
Tfix, t) of equation (2.6) are the time-dependent wave functions. An important 
goal in wave mechanics is solving equation (2.6) for x ¥(x, t ) using various 
expressions for V(x) that relate to specific physical systems. 

When V(x) is not constant, the solutions '(fix, t) to equation (2.6) may still 
be expanded in the form of a wave packet, 

1 . ( 2 . 7 ) 


b'U, 0 = 


\j2xcfi. 


A(p, t)^ px ~ Et)/fl dp 


The Fourier transform A(p, t) is then, in general, a function of both p and time 
t, and is given by 

1 f°° 

A(p, t ) = -== W(x, t)Q- {{px ~ Et) / h dv (2.8) 

\Z2jth J—oo 

By way of contrast, recall that in treating the free particle as a wave packet in 
Chapter 1, we required that the weighting factor A(p) be independent of time 
and we needed to specify a functional form for A(p) in order to study some of 
the properties of the wave packet. 


2.2 The wave function 

Interpretation 

Before discussing the methods for solving the Schrodinger equation for specific 
choices of F(x), we consider the meaning of the wave function. Since the wave 
function l) is identified with a particle, we need to establish the connec¬ 
tion between v P(x, t) and the observable properties of the particle. As in the 
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case of the free particle discussed in Chapter 1, we follow the formulation of 
Born (1926). 

The fundamental postulate relating the wave function x V(x, t ) to the proper¬ 
ties of the associated particle is that the quantity |W(jc s Of = 'F'fx, /)’I ; (x, t) 
gives the probability density for finding the particle at point x at time t. Thus, 
the probability of finding the particle between x and x + dx at time t is 
|W(x, t) | 2 dx. The location of a particle, at least within an arbitrarily small 
interval, can be determined through a physical measurement. If a series of 
measurements are made on a number of particles, each of which has the exact 
same wave function, then these particles will be found in many different 
locations. Thus, the wave function does not indicate the actual location at 
which the particle will be found, but rather provides the probability for finding 
the particle in any given interval. More generally, quantum theory provides the 
probabilities for the various possible results of an observation rather than a 
precise prediction of the result. This feature of quantum theory is in sharp 
contrast to the predictive character of classical mechanics. 

According to Born’s statistical interpretation, the wave function completely 
describes the physical system it represents. There is no information about the 
system that is not contained in Tfix, /). Thus, the state of the system is 
determined by its wave function. For this reason the wave function is also 
called the state function and is sometimes referred to as the state Tfix, t). 

The product of a function and its complex conjugate is always real and is 
positive everywhere. Accordingly, the wave function itself may be a real or a 
complex function. At any point x or at any time t, the wave function may be 
positive or negative. In order that | l F(.v, t )| 2 represents a unique probability 
density for every point in space and at all times, the wave function must be 
continuous, single-valued, and finite. Since x V(x, t) satisfies a differential 
equation that is second-order in x, its first derivative is also continuous. The 
wave function may be multiplied by a phase factor e lct , where a is real, without 
changing its physical significance since 

[e ia Tfix, /)f [c'^Ifix, 0] = 'fi*(x, t)W(x, t) = |W(x, 0| 2 


Normalization 

The particle that is represented by the wave function must be found with 
probability equal to unity somewhere in the range — oo x =£ oo, so that 
W(x, 0 must obey the relation 

•OO 

|Tfix, 0| 2 dx = 1 


— OO 


(2.9) 
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A function that obeys this equation is said to be normalized. If a function 
0(x, I) is not normalized, but satisfies the relation 

•OO 

0*(x, /)0(x, t)dx = N 

J—OO 

then the function ^(x, t ) defined by 

't'br, 0 = 0 


is normalized. 

In order for fifix, /) to satisfy equation (2.9), the wave function must be 
square-integrable (also called quadratically integrable). Therefore, v P(x, t) 
must go to zero faster than 1 / -^/jxj as x approaches (±) infinity. Likewise, the 
derivative cFF /Ox must also go to zero as x approaches (±) infinity. 

Once a wave function x V(x, t) has been normalized, it remains normalized as 
time progresses. To prove this assertion, we consider the integral 


N = 


*oo 

— OO 


and show that N is independent of time for every function l T that obeys the 
Schrodinger equation (2.6). The time derivative of N is 

•OO Q 

-|^(x, t )| 2 dx (2.10) 

— OO 



where the order of differentiation and integration has been interchanged on the 
right-hand side. The derivative of the probability density may be expanded as 
follows 


r) rf /iUJ 

-m x , t )\ 2 = -(w*w) = w* — + w 


o 


o x v 


. o y v 2 


Of 


dt 


dt dt 

Equation (2.6) and its complex conjugate may be written in the form 

dfi' i h d 2 W i yX y 


dt 2m dx 2 h 
D y V* i h 0 2y V* 


dt 


so that d|*F(x, t)\ 2 /dt becomes 


2 m dx 2 h 






dx 2 


where the terms containing V cancel. We next note that 
d 


V ... c) y V 


ur*_ur 

dx V dx dx 


dx 2 dx 2 


( 2 . 11 ) 
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so that 


d 


if? d 




chi' 


dW 

dx 


( 2 . 12 ) 


f°° d 

f, r * d x V 

d xp*\ 

if? 

r r * aw 

«qr*] 

— 

XI?*- 


1 dx = — 

\p* 


3x 

—oo 

dx 

dx j 

2 m 

dx 

dx 


dt 1 v ’ 71 2m dx 

Substitution of equation (2.12) into (2.10) and evaluation of the integral give 

d N _ if? 
d/ 2 m 

Since x l*(x, t ) goes to zero as x goes to (±) infinity, the right-most term 
vanishes and we have 

^ = 0 
d t 

Thus, the integral N is time-independent and the normalization of l l'(x, t) does 
not change with time. 

Not all wave functions can be normalized. In such cases the quantity 
| T'(x, t)\ 2 may be regarded as the relative probability density, so that the ratio 

(*«2 

Ifihx, t) | 2 dx 

a i 


b 2 


|W(x, t)\ 2 Ax 


represents the probability that the particle will be found between a\ and ai 
relative to the probability that it will be found between b\ and hi- As an 
example, the plane wave 

w( X , t) = e (px ~ Et)lh 

does not approach zero as x approaches (±) infinity and consequently cannot 
be normalized. The probability density ^(x, t)\ 2 is unity everywhere, so that 
the particle is equally likely to be found in any region of a specified width. 


Momentum-space wave function 

The wave function 'F(x, t) may be represented as a Fourier integral, as shown 
in equation (2.7), with its Fourier transform A(p, t) given by equation (2.8). 
The transform A(p, t ) is uniquely determined by l l'(x, t) and the wave function 
*P(x, t ) is uniquely determined by A(p, t). Thus, knowledge of one of these 
functions is equivalent to knowledge of the other. Since the wave function 
W(x, t ) completely describes the physical system that it represents, its Fourier 
transform A(p, t) also possesses that property. Either function may serve as a 
complete description of the state of the system. As a consequence, we may 
interpret the quantity \A(p, t )| 2 as the probability density for the momentum at 



2.3 Expectation values of dynamical quantities 


41 


time t. By Parseval’s theorem (equation (B.28)), if 'I'(x, t) is normalized, then 
its Fourier transform A(p, t) is normalized, 


*oo 

l*F(x, t )| 2 dx 

J—OO 


•OO 

\A(p, t)\ 2 dp = 1 

— OO 


The transform A(p, t) is called the momentum-space wave function, while 
'Fix, l) is more accurately known as the coordinate-space wave function. 
When there is no confusion, however, W(x, t ) is usually simply referred to as 
the wave function. 


2.3 Expectation values of dynamical quantities 


Suppose we wish to measure the position of a particle whose wave function is 
x F(x, t). The Born interpretation of |'F(x, t)\ 2 as the probability density for 
finding the associated particle at position x at time t implies that such a 
measurement will not yield a unique result. If we have a large number of 
particles, each of which is in state \F(jc, t) and we measure the position of each 
of these particles in separate experiments all at some time t, then we will obtain 
a multitude of different results. We may then calculate the average or mean 
value (x) of these measurements. In quantum mechanics, average values of 
dynamical quantities are called expectation values. This name is somewhat 
misleading, because in an experimental measurement one does not expect to 
obtain the expectation value. 

By definition, the average or expectation value of x is just the sum over all 
possible values of x of the product of x and the probability of obtaining that 
value. Since x is a continuous variable, we replace the probability by the 
probability density and the sum by an integral to obtain 


(x) 


•OO 

x| x F(x, i)| 2 dx 

— OO 


(2.13) 


More generally, the expectation value (/(x)) of any function /(x) of the 
variable x is given by 


(/(*)) 


•OO 

/(x)l'F(x, t)\ 2 dx 

— OO 


(2.14) 


Since *F(x, /) depends on the time t, the expectation values (x) and (/(x)) in 
equations (2.13) and (2.14) are functions of t. 

The expectation value (p) of the momentum p may be obtained using the 
momentum-space wave function A(p, t) in the same way that (x) was obtained 
from 'F(x, t). The appropriate expression is 
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•oo poo 

(p) = p\A(p, t)\ 2 dp = pA*(p, t)A{p, t)dp (2.15) 

J— oo J—oo 

The expectation value ( f(p )) of any function /’( p) of p is given by an 
expression analogous to equation (2.14) 

•OO 

(./OO) = f(p)\Ap, t)\ 2 dp (2.16) 

— OO 

In general, A(p, t ) depends on the time, so that the expectation values (p) and 
( f(p )) are also functions of time. 

Both W(x, t ) and A(p, t ) contain the same information about the system, 
making it possible to find (p) using the coordinate-space wave function 
^P(x, t ) in place of A(p, t). The result of establishing such a procedure will 
prove useful when determining expectation values for functions of both 
position and momentum. We begin by taking the complex conjugate of A(p, t) 
in equation (2.8) 

1 

A*(p , 0 = -= W*(x, t)e' (px - E,)/h dx 

\/2jth J—oo 

Substitution of A*(p, t) into the integral on the right-hand side of equation 
(2.15) gives 

OO 

(p) =^L= [[ >fi*(x, t)pA(p, t)e' ipx - E,)/fl dxdp 

yljth JJ 

—oo 


= ^*(X, t) 


pA(p, t)e l{px Ei) A' dp \ dx (2.17) 


J-oo V ’ ' "J V 7 

In order to evaluate the integral over p, we observe that the derivative of 
^(x, t ) in equation (2.7), with respect to the position variable x, is 

v J -oo ^ 

Substitution of this observation into equation (2.21) gives the final result 


(P) = 


'°° fh B\ 

W*(x, t)^.—J^(x, t) dx 


(2.18) 


Thus, the expectation value of the momentum can be obtained by an integration 
in coordinate space. 

The expectation value of /r is given by equation (2.16) with /( p) = p 1 . The 
expression analogous to (2.17) is 

•oo r -j poo 

(p 2 )= w*(x,t)~== p 2 A(p, t)e iipx - Et)/h dp dx 

.. —oo V 2 jtfl «. — oo 


p 2 a( P , t)e {px - Et)lh d p \ dx 


J-oo v > 'W5dl}^r 7 J 

From equation (2.7) it can be seen that the quantity in square brackets equals 
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fh\ 2 d 2 W(x, t ) 

\i/ dx 2 

so that 

*oo / > \ 2 <-^2 

(p 2 ) = W*(x, 0(jJ , )x2 ’B(x, 0 dx (2.19) 

Similarly, the expectation value of p n is given by 

.00 / > \ yi 

(p n ) = W*(x, t) (j — J W(x, t ) dx (2.20) 

Each of the integrands in equations (2.18), (2.19), and (2.20) is the complex 
conjugate of the wave function multiplied by an operator acting on the wave 
function. Thus, in the coordinate-space calculation of the expectation value of 
the momentum p or the n th power of the momentum, we associate with p the 
operator (h / i)(d / dx). We generalize this association to apply to the expectation 
value of any function /( p) of the momentum, so that 

(f(p)) = ^(x, t)f^-)mx, t)Ax (2.21) 

Equation (2.21) is equivalent to the momentum-space equation (2.16). 

We may combine equations (2.14) and (2.21) to find the expectation value of 
a function f(x, p) of the position and momentum 

f 00 / p d \ 

{fix, p)) = 'E*(x, t)f[x, . . )x j W(x, t) dx (2.22) 

Ehrenfest’s theorems 

According to the correspondence principle as stated by N. Bohr (1928), the 
average behavior of a well-defined wave packet should agree with the classical- 
mechanical laws of motion for the particle that it represents. Thus, the 
expectation values of dynamical variables such as position, velocity, momen¬ 
tum, kinetic energy, potential energy, and force as calculated in quantum 
mechanics should obey the same relationships that the dynamical variables 
obey in classical theory. This feature of wave mechanics is illustrated by the 
derivation of two relationships known as Ehrenfest’s theorems. 

The first relationship is obtained by considering the time dependence of the 
expectation value of the position coordinate x. The time derivative of (x) in 
equation (2.13) is 
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d(x) 

d t 


a r 

d i , —oo 

\h ro ° 


x\ x V(x, t )| 2 dx = 


x^-\ x V(x, t)\ 2 dx 


2 m 


x 


d 

dx 


dW 


dW 


m* _ m 

dx dx 


dx 


where equation (2.12) has been used. Integration by parts of the last integral 
gives 


d(x) i h 


= _—x 


At 2m 


d^ 

dx dx 


1 h ro ° 

2 m 




dW* 


m* _ rp 

dx dx 


dx 


The integrated part vanishes because ^(x, t) goes to zero as x approaches (±) 
infinity. Another integration by parts of the last term on the right-hand side 
yields 

d(x) 1 


dr 


m 


00 /h d 

W*(~)WAx 

i dx 


According to equation (2.18), the integral on the right-hand side of this 
equation is the expectation value of the momentum, so that we have 

. d(x) 


(p) = m- 


At 


(2.23) 


Equation (2.23) is the quantum-mechanical analog of the classical definition of 
momentum, p = mv = m(dx/dr). This derivation also shows that the associa¬ 
tion in quantum mechanics of the operator (h/\)(d/dx) with the momentum is 
consistent with the correspondence principle. 

The second relationship is obtained from the time derivative of the expecta¬ 
tion value of the momentum (p) in equation (2.18), 


d(/>) 

dr 


d '■ 0O 

dr 




i dx 


h r 0C 
i 


1Tr * d d^\ , 

~dTlh +w lh~dt) dx 


We next substitute equations (2.11) for the time derivatives of 'T and 'T and 
obtain 


d (p) 

At 


-fi 2 d 2 w* * d fh 2 d 2 W 

J dx dx \2m dx 2 


2 m dx 2 

— ft 2 r d 2x i>* chP 


- VW 


Ax 


2 m 


fr 

dx 2 dx 2m 


xp 


d 3 W 
dx 3 


dx — 


'F*'P^dx 

dx 


(2.24) 

where the terms in V cancel. The first integral on the right-hand side of 
equation (2.24) may be integrated by parts twice to give 
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d 2 W* d^> 
, <)x 2 dx 


dx = 


OW*dW 


dx dx 


gxy* tfxy 

, dx dx 2 


dx 


>¥* dW 

. dx dx 


'V 


d 2 H> 


dx 2 


+ 


1-00 a3rn 

dx 

— OO 


dx 3 


The integrated part vanishes because 'F and (PP /dx vanish at (±) infinity. The 
remaining integral cancels the second integral on the right-hand side of 
equation (2.24), leaving the final result 


d(p) 
d t 



(2.25) 


where equation (2.1) has been used. Equation (2.25) is the quantum analog of 
Newton’s second law of motion, F = ma, and is in agreement with the 
correspondence principle. 


Heisenberg uncertainty principle 

Using expectation values, we can derive the Heisenberg uncertainty principle 
introduced in Section 1.5. If we define the uncertainties Ax and Ap as the 
standard deviations of x and p, as used in statistics, then we have 


Ax = ((x — (x)) 2 ) 1 / 2 
&P = {(P ~ {p)) 2 ) l/2 


The expectation values of x and of p at a time t are given by equations (2.13) 
and (2.18), respectively. For the sake of simplicity in this derivation, we select 
the origins of the position and momentum coordinates at time t to be the 
centers of the wave packet and its Fourier transform, so that (x) = 0 and 
(p) = 0. The squares of the uncertainties Ax and A p are then given by 


(Ax) 2 


*oo 

x 2 rp*rp dx 

— OO 


(A pY = I j 


oo a2m 

rjr* ° ^ 


dx 2 


dx = 


i 


dW 

dx 


dW* dW 
dx dx 


dx 


•OO 

—OO 


f-hd x V*\ 

\i dx ) 


(m 

\ \ dx J 


dx 


where the integrated term for (Ap) 2 vanishes because 'F goes to zero as x 
approaches (±) infinity. 

The product (AxA p) 2 is 
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(Ax A p) 2 = 


(x'V*)(xW) dx 


-hdW* 


fh dW\ 


v-s- dx 


dx ) Vi dx J 


Applying Schwarz’s inequality (A.56), we obtain 
(Ax A p) 2 


1 

h 

r 00 / ^dw , r dw*\ 

2 h 2 

r 00 d I 

_ 

4 

i . 

( x x V + xW I dx 

_oo \ dx dx J 

~~4 

i 

8 

X 

%\' 

* 

■) 

£ 
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r 

CXD 

*oo 

x xp*qr 


^P*Wdx 

L 

—oo 

—OO 


The integrated part vanishes because W goes to zero faster than 1 / yj\x\, as x 
approaches (±) infinity and the remaining integral is unity by equation (2.9). 
Taking the square root, we obtain an explicit form of the Heisenberg uncer¬ 
tainty principle 


AxA p 



(2.26) 


This expression is consistent with the earlier form, equation (1.44), but relation 
(2.26) is based on a precise definition of the uncertainties, whereas relation 
(1.44) is not. 


2.4 Time-independent Schrodinger equation 


The first step in the solution of the partial differential equation (2.6) is to 
express the wave function 'Ifix, t) as the product of two functions 

'tfix, 0 = rHx)x(t) (2.27) 


where ip(x) is a function of only the distance x and y(t) is a function of only 
the time t. Substitution of equation (2.27) into (2.6) and division by the product 
yKx)x(t) give 


i h 


1 dy(0 

X(t) d t 


fi 2 1 d 2 y>(x) 
2 mxp(x) dx 2 


(2.28) 


The left-hand side of equation (2.28) is a function only of t, while the right- 
hand side is a function only of x. Since x and t are independent variables, each 
side of equation (2.28) must equal a constant. If this were not true, then the 
left-hand side could be changed by varying t while the right-hand side 
remained fixed and so the equality would no longer apply. For reasons that will 
soon be apparent, we designate this separation constant by E and assume that 
it is a real number. 

Equation (2.28) is now separable into two independent differential equations, 
one for each of the two independent variables x and t. The time-dependent 
equation is 
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which has the solution 



Exit) 


XM) = e~ iEt/h (2.29) 

The integration constant in equation (2.29) has arbitrarily been set equal to 
unity. The spatial-dependent equation is 

h 2 d 2 U>[x) 

F(x) ^ (x) = mx) (2 - 30) 

and is called the time-independent Schrddinger equation. The solution of this 
differential equation depends on the specification of the potential energy V(x). 
Note that the separation of equation (2.6) into spatial and temporal parts is 
contingent on the potential V(x ) being time-independent. 

The wave function x ¥(x, t ) is then 

x ¥(x, t) = rp(x)Q-' Et/h (2.31) 

and the probability density |*P(x, /)| 2 is now given by 

|*P(x, t) | 2 = W*(x, tf¥(x, t) = ip*(x)e lEt/h ip(x)e- lEt/h = \tp(x )\ 2 

Thus, the probability density depends only on the position variable x and does 
not change with time. For this reason the wave function fi^x, t) in equation 
(2.31) is called a stationary state. If v F(x, t) is normalized, then rjj(x) is also 
normalized 


\ip(x)\ 2 dx = 1 (2.32) 

J—OO 

which is the reason why we set the integration constant in equation (2.29) equal 
to unity. 

The total energy, when expressed in terms of position and momentum, is 
called the Hamiltonian, H, and is given by 

I Ex, p) = ^-+ V(x) 

2 m 

The expectation value ( H) of the Hamiltonian may be obtained by applying 
equation (2.22) 

“OO 2 £)2 

(H) = ^*{x, t) V(x) X ¥(x, t)dx 

J-oo L 2m ox~ 

For the stationary state (2.31), this expression becomes 

•oo ^2 ^2 

(H) = ip*(x) -^ 0^2 + v w V(x)dx 
If we substitute equation (2.30) into the integrand, we obtain 
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poo 


< H)=E 


xp*(x)xp(x) dx = E 


J — oo 


where we have also applied equation (2.32). We have just shown that the 
separation constant E is the expectation value of the Hamiltonian, or the total 
energy for the stationary state, so that ‘ E’ is a desirable designation. Since the 
energy is a real physical quantity, the assumption that E is real is justified. 

In the application of Schrodinger’s equation (2.30) to specific physical 
examples, the requirements that xp(x) be continuous, single-valued, and square - 
integrable restrict the acceptable solutions to an infinite set of specific functions 
rpn(x), n = 1, 2, 3, ..., each with a corresponding energy value E n . Thus, the 
energy is quantized, being restricted to certain values. This feature is illustrated 
in Section 2.5 with the example of a particle in a one-dimensional box. 

Since the partial differential equation (2.6) is linear, any linear superposition 
of solutions is also a solution. Therefore, the most general solution of equation 
(2.6) for a time-independent potential energy V(x) is 


W(x, t) = c n xp n (x)& lE " f / h (2.33) 

n 

where the coefficients c n are arbitrary complex constants. The wave function 
*P(x, t) in equation (2.33) is not a stationary state, but rather a sum of 
stationary states, each with a different energy E n . 


2.5 Particle in a one-dimensional box 


As an illustration of the application of the time-independent Schrodinger 
equation to a system with a specific form for V (x), we consider a particle 
confined to a box with infinitely high sides. The potential energy for such a 
particle is given by 

V (x) = 0, 0 =£ x «£ a 

= oo, x < 0, x > a 


and is illustrated in Figure 2.1. 

Outside the potential well, the Schrodinger equation (2.30) is given by 


h 2 d 2 ip 
2 m dx 2 


F oo xp 


Exp 


for which the solution is simply xp(x) = 0; the probability is zero for finding 
the particle outside the box where the potential is infinite. Inside the box, the 
Schrodinger equation is 


h 2 d 2 xp 
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Figure 2.1 The potential energy V(x) for a particle in a one-dimensional box of length a. 


or 


d 2 ip 
dx 2 



where X is the de Broglie wavelength, 

2 7th h 
\j2mE p 


(2.34) 

(2.35) 


We have implicitly assumed here that E is not negative. If E were negative, 
then the wave function ip and its second derivative would have the same sign. 
As |x| increases, the wave function ip(x) and its curvature d 2 ip/dx 2 would 
become larger and larger in magnitude and ip(x) would approach (±) infinity 
as x —> oo. 

The solutions to equation (2.34) are functions that are proportional to their 
second derivatives, namely sin(2jrx/2) and cos(2jrx/A). The functions 
exp[2jrix/>i] and exp[— 2jtbc/X], which as equation (A.31) shows are equivalent 
to the trigonometric functions, are also solutions, but are more difficult to use 
for this system. Thus, the general solution to equation (2.34) is 

, . 2 7tx „ 2jtx 

ip(x) = A sin——hScos^— (2.36) 

A A 


where A and B are arbitrary constants of integration. 

The constants A and B are determined by the boundary conditions placed on 
the solution ip(x). Since ip(x) must be continuous, the boundary conditions 
require that ip(x) vanish at each end of the box so as to match the value of ip(x) 
outside the box, i.e., t/;(0) = ip(a) = 0. At x = 0, the function ip( 0) from (2.36) 
is 


so that B 


ip( 0) = A sin 0 + B cos 0 = 5 


0 and ip(x) is now 


ip(x) 


A sin 


2jtx 

~T~ 


(2.37) 


At x = a, ip{a) is 
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ip(a) 


A sin 


Ina 

~k 


0 


The constant A cannot be zero, for then xp(x) would vanish everywhere and 
there would be no particle. Consequently, we have sin(2jra/A) = 0 or 
2 jici 

. = rm, n = 1, 2, 3, ... 


where n is any positive integer greater than zero. The solution n = 0 would 
cause ip(x) to vanish everywhere and is therefore not acceptable. Negative 
values of n give redundant solutions because sin(— 6) equals —sin 0. 

We have found that only distinct values for the de Broglie wavelength satisfy 
the requirement that the wave function represents the motion of the particle. 
These distinct values are denoted as X n and are given by 

2 ^ 

X n = —, n = 1,2,3,... (2.38) 

n 

Consequently, from equation (2.35) only distinct values E„ of the energy are 
allowed 


n 2 JT 2 h 2 n 2 h 2 
2 ma 2 8 ma 2 ’ 


n = 1, 2, 3, ... 


(2.39) 


so that the energy for a particle in a box is quantized. 

The lowest allowed energy level is called the zero-point energy and is given 
by E\ = h 2 /8 ma 2 . This zero-point energy is always greater than the zero value 
of the constant potential energy of the system and increases as the length a of 
the box decreases. The non-zero value for the lowest energy level is related to 
the Heisenberg uncertainty principle. For the particle in a box, the uncertainty 
Ax in position is equal to the length a since the particle is somewhere within 
the box. The uncertainty Ap in momentum is equal to 2\p\ since the 
momentum ranges from — \p\ to \p\. The momentum and energy are related by 

\p\ = y/lmE = — 


so that 


AxA p = nh 

is in agreement with the Heisenberg uncertainty principle (2.26). If the lowest 
allowed energy level were zero, then the Heisenberg uncertainty principle 
would be violated. 

The allowed wave functions t/’„(x) for the particle in a box are obtained by 
substituting equation (2.38) into (2.37), 

. nzcx 

ip n (x) = A sm-, 0 =£ x *£ a 


a 
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The remaining constant of integration A is determined by the normalization 
condition (2.32), 


\ipn(x)\ 2 dx = \A \ 2 


sim-dx = \A\ 2 — 

a Jt 


o 


sin 2 n0d0 = \A\ 2 - = 1 
o 2 


where equation (A. 15) was used. Therefore, we have 



Setting the phase a equal to zero since it has no physical significance, we 
obtain for the normalized wave functions 


Xpnix) = 

= 0, x < 0, x > a 

The time-dependent Schrodinger equation (2.30) for the particle in a box has 
an infinite set of solutions ip n (x) given by equation (2.40). The first four wave 
functions ip „(x) for n = 1, 2, 3, and 4 and their corresponding probability 
densities \ip n {x)\ 2 are shown in Figure 2.2. The wave function f | (x) corre¬ 
sponding to the lowest energy level E\ is called the ground state. The other 
wave functions are called excited states. 

If we integrate the product of two different wave functions ?/>/(x) and i/;„(x), 
we find that 


2 . nxcx 

- sin-, 

a a 


(2.40) 


>a 

rp i(x)r/j n (x) dx 
o 


2 . ( Ijtx\ . (njtxA 

- sin - sin - dx 

a Jo \ a J \ a J 


2 

— sin/0 sin d0 = 0 
71 Jo 


(2.41) 


where equation (A. 15) has been introduced. This result may be combined with 
the normalization relation to give 


>a 

Xpl(x)Xp n {x) dx 


0 


where 6 /„ is the Kronecker delta, 


Sin 


5 in = 1 , l = n 

= 0, / n 


(2.42) 


(2.43) 


Functions that obey equation (2.41) are called orthogonal functions. If the 
orthogonal functions are also normalized, as in equation (2.42), then they are 







Figure 2.2 Wave functions xp t and probability densities \ip t \ 2 for a particle in a one¬ 
dimensional box of length a. 


said to be orthonormal. The orthogonal property of wave functions in quantum 
mechanics is discussed in a more general context in Section 3.3. 

The stationary states T^x, t) for the particle in a one-dimensional box are 
given by substitution of equations (2.39) and (2.40) into (2.31), 

W(x, sin (2.44) 

The most general solution (2.33) is, then, 

—i(n 2 jt 2 h/2ma 2 )t 

c 



W(X, 0 


(2.45) 
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2.6 Tunneling 

As a second example of the application of the Schrodinger equation, we 
consider the behavior of a particle in the presence of a potential barrier. The 
specific form that we choose for the potential energy V(x) is given by 

V(x) = Vq, 0 s£ x «£ a 


= 0, x < 0, x > a 


and is shown in Figure 2.3. The region where x<0 is labeled I, where 
0 x s£ a is labeled II, and where x > a is labeled III. 

Suppose a particle of mass m and energy E coming from the left approaches 
the potential barrier. According to classical mechanics, if E is less than the 
barrier height Vq, the particle will be reflected by the barrier; it cannot pass 
through the barrier and appear in region III. In quantum theory, as we shall see, 
the particle can penetrate the barrier and appear on the other side. This effect is 
called tunneling. 

In regions I and III, where V(x) is zero, the Schrodinger equation (2.30) is 


d 2/ ip(x) 

dx 2 


2mE 


ip(x) 


(2.46) 


The general solutions to equation (2.46) for these regions are 

ipi = Ae iax + Be~ iax (2.47 a) 

Vm = Fe iax + Ge~ iax (2.47 b) 


where A, B, F, and G are arbitrary constants of integration and a is the 
abbreviation 


In region II, where V (x) 


a = 


\j2mE 

fi 


(2.48) 


Vq > E, the Schrodinger equation (2.30) becomes 



Figure 2.3 Potential energy barrier of height Vq and width a. 
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d 2 f(x) 2 m 

-&r = - m(x) 

for which the general solution is 

fu = CeP x + De~ fix 

where C and D are integration constants and ( J > is the abbreviation 

\/2 m(Vo-E) 


P = 


h 


(2.49) 


(2.50) 


(2.51) 


The term exp[iax] in equations (2.47) indicates travel in the positive .in¬ 
direction, while exp[—iax] refers to travel in the opposite direction. The 
coefficient A is, then, the amplitude of the incident wave, B is the amplitude of 
the reflected wave, and F is the amplitude of the transmitted wave. In region 
III, the particle moves in the positive x-direction, so that G is zero. The relative 
probability of tunneling is given by the transmission coefficient T 

1*1 2 


T = 


M P 


(2.52) 


and the relative probability of reflection is given by the reflection coefficient R 

\B \ 2 


R = 


MF 


(2.53) 


The wave function for the particle is obtained by joining the three parts fi, 
(/'ii, and fin such that the resulting wave function f(x) and its first derivative 
f'(x) are continuous. Thus, the following boundary conditions apply 

fi(0) = fn(0), f{(0) = fii(0) (2.54) 


fiu(a) = fin (o), f[i (a) = f'm (a) 


(2.55) 


These four relations are sufficient to determine any four of the constants A, B, 
C, D, F in terms of the fifth. If the particle were confined to a finite region of 
space, then its wave function could be normalized, thereby determining the fifth 
and final constant. However, in this example, the position of the particle may 
range from — oo to oo. Accordingly, the wave function cannot be normalized, 
the remaining constant cannot be evaluated, and only relative probabilities such 
as the transmission and reflection coefficients can be determined. 

We first evaluate the transmission coefficient T in equation (2.52). Applying 
equations (2.55) to (2.47 b) and (2.50), we obtain 

Ce /ia + De~ pa = Fe iaa 
- DQ~ pa ) = i aFe iaa 


from which it follows that 
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C = F 


D = F 


/3 + i a 
— i a 

w 


_,(i a—fl)a 


Jia+(S)a 
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(2.56) 


(2.57) 


Application of equation (2.54) to (2.47 a) and (2.50) gives 

A+B=C+D 
i a(A - B) = 0(C - D) 

Elimination of B from the pair of equations (2.57) and substitution of equations 
(2.56) for C and for D yield 

A = ^[(j3 + ia)C-(j3-ia)D] 

2i a 


= F 


; [(P + ia)“e 


2 “~ fia — (J3 — i a)V fl ] 


4i a/3 

At this point it is easier to form \A\ 2 before any further algebraic simplifica¬ 


tions 

\a\ 2 = a*a 


= \F\ 7 


= 1 Ff 


= |F | 2 


16a 2 /T 

1 


■ [( ft 2 + a 2 ) 2 e 2/3a + (J5 2 + a 2 ) 2 e 2/3a — (fi — ia) 4 - (/? + ia) 4 ] 


1 + 


[(a 2 + /8 2 ) 2 (e^ a - e _/3a ) 2 + 1 6a 2 /3 2 ] 
sinlr [A a 


16a 2 /3 2 

(« 2 +/ 32 ) 2 ..-„u2 


4a 2 /3 2 


where equation (A.46) has been used. Combining this result with equations 
(2.48), (2.51), and (2.52), we obtain 

-l 


T = 


1 + 


Vi 


4E( Vo - E) 


sinh 2 (Y / 2m(Fo — E) a/h) 


(2.58) 


To find the reflection coefficient R , we eliminate A from the pair of 
equations (2.57) and substitute equations (2.56) for C and for D to obtain 

i pi aa 

B = — [-03 - ia)C + 03 + i o)D] = F-—[(a 2 + /SV° - (a 2 + /3 2 )e"^] 
2ia 4i ap 

pi aa 

= F— — -(a 2 + /? 2 )sinh /5 a 
2i ap 



56 


Schrodinger wave mechanics 


where again equation (A.46) has been used. Combining this result with 
equations (2.52) and (2.53), we find that 

\B\ 2 ( a 2 +/3 2 f . l2 „ 

R = smh 2 fia 

\F\ 2 4 a 2 p 2 ' 

Substitution of equations (2.48), (2.51), and (2.58) yields 


Vi 


R = 


4E( V () - E) 


sinh 2 (Y / 2m(Fo — E) a/h) 


1 + 


Vi 


4E(Vo - E) 


sinh 2 (y/2m(Vo — E) a/h ) 


(2.59) 


The transmission coefficient T in equation (2.58) is the relative probability 
that a particle impinging on the potential barrier tunnels through the barrier. 
The reflection coefficient R in equation (2.59) is the relative probability that 
the particle bounces off the barrier and moves in the negative x-direction. Since 
the particle must do one or the other of these two possibilities, the sum of T 
and R should equal unity 

T + R = 1 


which we observe from equations (2.58) and (2.59) to be the case. 

We also note that the (relative) probability for the particle being in the region 
0 s£ x a is not zero. In this region, the potential energy is greater than the 
total particle energy, making the kinetic energy of the particle negative. This 
concept is contrary to classical theory and does not have a physical signifi¬ 
cance. For this reason we cannot observe the particle experimentally within the 
potential barrier. Further, we note that because the particle is not confined to a 
finite region, the boundary conditions on the wave function have not imposed 
any restrictions on the energy E. Thus, the energy in this example is not 
quantized. 

In this analysis we considered the relative probabilities for tunneling and 
reflection for a single particle. The conclusions apply equally well to a beam of 
particles, each of mass m and total energy E, traveling initially in the positive 
x-direction. In that case, the transmission coefficient Tin equation (2.58) gives 
the fraction of incoming particles that tunnel through the barrier, and the 
reflection coefficient R in equation (2.59) gives the fraction that are reflected 
by the barrier. 

If the potential barrier is thick (a is large), the potential barrier is high 
compared with the particle energy E (V, o> E), the mass m of the particle is 
large, or any combination of these characteristics, then we have 

sinh/3fl = l -y a - Q-y « ^ 



2.7 Particles in three dimensions 


57 


so that T and R become 



In the limit as a —■» oo, as F 0 —► oo, as m —> oo, or any combination, the 
transmission coefficient T approaches zero and the reflection coefficient R 
approaches unity, which are the classical-mechanical values. We also note that 
in the limit h —* 0, the classical values for T and R are obtained. 

Examples of tunneling in physical phenomena occur in the spontaneous 
emission of an alpha particle by a nucleus, oxidation-reduction reactions, 
electrode reactions, and the umbrella inversion of the ammonia molecule. For 
these cases, the potential is not as simple as the one used here, but must be 
selected to approximate as closely as possible the actual potential. However, 
the basic qualitative results of the treatment here serve to explain the general 
concept of tunneling. 


2.7 Particles in three dimensions 


Up to this point we have considered particle motion only in the x-direction. 
The generalization of Schrodinger wave mechanics to three dimensions is 
straightforward. In this section we summarize the basic ideas and equations of 
wave mechanics as expressed in their three-dimensional form. 

The position of any point in three-dimensional cartesian space is denoted by 
the vector r with components x, y, z, so that 

r = ix + j y + k z (2.60) 

where i, j, k are, respectively, the unit vectors along the x, y, z cartesian 
coordinate axes. The linear momentum p of a particle of mass m is given by 


dr 

p = m—= m 
d t 


.dx .dv 

'd7 + J d7 + k 



i Px +\Pv + k/>. 


(2.61) 


The x-component, p x , of the momentum now needs to carry a subscript, 
whereas before it was denoted simply as p. The scalar or dot product of r and 
p is 

r • P = P * r = xp x + yp y + zp z 

and the magnitude p of the vector p is 

P = (P * P)‘ /2 = ip\ + P 2 y + F 2 ) 1/2 
The classical Hamiltonian H{ p, r) takes the form 
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H( p, r) = + V(r) = - 1 - (p\ + p\ + pi) + F(r) 

2 m 2 m y 


(2.62) 


When expressed in three dimensions, the de Broglie relation is 

p = Kk (2.63) 

where k is the vector wave number with components k x , k v , k z . The de Broglie 
wavelength X is still given by 

, In h 

l = T = p 

where now k and p are the magnitudes of the corresponding vectors. The wave 
packet representing a particle in three dimensions is 

. 1 f .. . , .. 


^(r, 0 = 


A(p, t)e 


i(p-r -Et)/h 


(2.64) 


~ (2nh)V* ) 

As shown by equations (B.19), (B.20), and (B.27), the momentum-space wave 
function A( p, t) is a generalized Fourier transform of 'l ; (r, /), 


A(V, t) = j^j- 2 jw(r, dr 


(2.65) 


The volume elements dr and dp are defined as 

dr = dx d y dz 
dp = d p x d p y d p z 

and the integrations extend over the complete range of each variable. 

For a particle moving in three-dimensional space, the quantity 

'F ; (r, /)T ; (r, /) dr = 'F ; (x, y, z, t) x P(jc, y, z, t) dx dy dz 

is the probability at time t of finding the particle with its x-coordinate between 
x and x + dx, its y-coordinate between y and y + dy, and its z-coordinate 
between z and z + dz. The product v F*(r, /)'F(r t) is, then, the probability 
density at the point r at time t. If the particle is under the influence of an 
external potential field V (r), the wave function v F(r, t) may be normalized 


^P*(r, /)'F(r, /) dr = 1 


( 2 . 66 ) 


The quantum-mechanical operators corresponding to the components of p 


or, in vector notation 


(2.67) 
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where the gradient operator V is defined as 

_ . <9 . d d 

^ = i np.+ j ^ + te§T 
ox oy oz 

Using these relations, we may express the Hamiltonian operator in three 
dimensions as 

H = ^-V 2 +V(r) 

2 m 

where the laplacian operator V 2 is defined by 

72 — 


2 d 2 d 2 d 2 

V / = V- V = — + — + 


dx 2 dy 2 ' dz 2 

The time-dependent Schrodinger equation is 

ih a ^Pl=imr,l) 


dt 


—ft 2 
2 m 


V 2 ’l ; (r, t) + F(r)'t'(r, t ) 


( 2 . 68 ) 

(2.69) 


The stationary-state solutions to this differential equation are 

Wn(r, t ) = xp n (r)e~ iE ^ h 

where the spatial functions V;;( r ) are solutions of the time-independent 
Schrodinger equation 

-h 2 , 


2 m 


V ip n ( r) + V(r)ip n (r) = E n xp n (r ) 


The most general solution to equation (2.68) is 

^(r, i) = J2cnVn(r)e- iE " t/h 


(2.70) 


(2.71) 


where c n are arbitrary complex constants. 

The expectation value of a function /(r, p) of position and momentum is 
given by 


</(r, P)> 


W*(r, t)f 



^(r, t ) dr 


(2.72) 


Equivalently, expectation values of three-dimensional dynamical quantities 
may be evaluated for each dimension and then combined, if appropriate, into 
vector notation. For example, the two Ehrenfest theorems in three dimensions 


are 
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(P) 


= m 


d(r) 

d t 


d(p) 

At 


~(W) = (F) 


where F is the vector force acting on the particle. The Heisenberg uncertainty 
principle becomes 

ft ft ft 

AxAp x =s -, AyAp y 3= -, AzA p z 3= - 


Multi-particle system 

For a system of N distinguishable particles in three-dimensional space, the 
classical Hamiltonian is 

p\ p 2 N 

H( pi,p 2 , r i> r 2, = + -F - - 

2ni\ 2 m 2 2% 

+ F(ri, r 2 , , r N ) 

where r* and p/ are the position and momentum vectors of particle k. Thus, 
the quantum-mechanical Hamiltonian operator is 

H = —— f Vj 4 V 2 +•■•-! + ^( r i> r 2, ■ ■ ■, i-.v) (2.73) 

2 \m 1 m 2 m^j J 

where V 2 k is the laplacian with respect to the position of particle k. 

The wave function for this system is a function of the N position vectors: 
^(ri, r 2 , ..., r N , t). Thus, although the N particles are moving in three- 
dimensional space, the wave function is 3/V-dimensional. The physical inter¬ 
pretation of the wave function is analogous to that for the three-dimensional 
case. The quantity 

x P*(r 1 , r 2 , ..., r N , tyU(r u r 2 , ..., r N , t)dri dr 2 ... dr N 

= W*(x u y 1 , zi, x 2 , ..., z N )W(x 1 , y u z u x 2 , ..., z N ) cbc, dyi dzi dx 2 ... d z N 

is the probability at time t that, simultaneously, particle 1 is between x\, y\, z\ 
and xi + dxi, y t + dyi, z\ + dzi, particle 2 is between jc 2 , y 2 , z 2 and x 2 + dx 2 , 
y 2 + dy 2 , z 2 + dz 2 , ..., and particle N is between x,v, yx, z N and x,v + dxy, 
Vn + d_w, z,v + dz v . The normalization condition is 

x P*(ri, r 2 , ..., r N , ^(r,, r 2 , ..., r, v , t) dr, dr 2 ... dr w = 1 (2.74) 

This discussion applies only to systems with distinguishable particles; for 
example, systems where each particle has a different mass. The treatment of 
wave functions for systems with indistinguishable particles is more compli- 
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cated and is discussed in Chapter 8. Such systems include atoms or molecules 
with more than one electron, and molecules with two or more identical nuclei. 


2.8 Particle in a three-dimensional box 


A simple example of a three-dimensional system is a particle confined to a 
rectangular container with sides of lengths a, b, and c. Within the box there is 
no force acting on the particle, so that the potential V(r) is given by 

V (r) = 0, 0 x *£ a, 0 y b, 0 s£ z c 

= oo, x < 0, x > a; y< 0, y> b; z< 0, z> c 

The wave function ip(r) outside the box vanishes because the potential is 
infinite there. Inside the box, the wave function obeys the Schrodinger equation 
(2.70) with the potential energy set equal to zero 


—it 1 ( d 2 ip( r) | d 2 y>(r) | d 2 y>(r) 
~f~ n T f 


2 m \ Ox 2 


dy 2 


dz 2 


= Exp{ r) 


(2.75) 


The standard procedure for solving a partial differential equation of this type is 
to assume that the function xp( r) may be written as the product of three 
functions, one for each of the three variables 

V'OO = r P( x ’ T> z ) = X( x )Y(y)Z(z) (2.76) 

Thus, X(x) is a function only of the variable x, Y ( y) only of y, and Z(z) only of 
z. Substitution of equation (2.76) into (2.75) and division by the product XYZ 
give 

-h 2 d 2 X -ft 2 d 2 Y -ft 2 d 2 Z 


+ 


+ 


= E 


(2.77) 


2 mX dx 2 2m Y dy 2 2mZ dz 2 
The first term on the left-hand side of equation (2.77) depends only on the 
variable x, the second only on y, and the third only on z. No matter what the 
values of x, or y, or z, the sum of these three terms is always equal to the same 
constant E. The only way that this condition can be met is for each of the three 
terms to equal some constant, say E x , E y , and E z , respectively. The partial 
differential equation (2.77) can then be separated into three equations, one for 
each variable 


d 2 X 2m ^ „ n 

~d^ + W ExX -°’ 


d 2 7 


dy 2 


2 m 

v E ’ r = 0 - 


d 2 Z 2m 


dz 2 h 2 


E-Z = 0 


where 


E x + E v + E z — E 


(2.78) 


( 2 . 79 ) 
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Thus, the three-dimensional problem has been reduced to three one-dimen¬ 
sional problems. 

The differential equations (2.78) are identical in form to equation (2.34) and 
the boundary conditions are the same as before. Consequently, the solutions 
inside the box are given by equation (2.40) as 



[l . n x Jix 



X(x) = ^ 

J sin 

n x = 1, 2, 3, ... 



V a a 



Y(y) = 1 

12 . n v Jtv 

tb Sm b ’ 

n y = 1, 2, 3, ... 

(2.80) 


2 . n z Jiz 

Z(z) = y- sin-, 

Vc c 


n- = 1, 2, 3, 


and the constants E x , E y , E z are given by equation (2.39) 

E x — Q 2 ’ 1, 2, 3, ... 

t$ma z 


n 2 h 2 
F = _j_ _ 

y 87 nb 2 ’ 


n y = 1, 2, 3, ... 


(2.81) 



The quantum numbers n x , n y , n z take on positive integer values independently 
of each other. Combining equations (2.76) and (2.80) gives the wave functions 
inside the three-dimensional box 


f n x ,n y ,n z ( r ) 



n v Jt y . n z Jtz 

sin ——— sin- 

b c 


(2.82) 


where v = abc is the volume of the box. The energy levels for the particle are 
obtained by substitution of equations (2.81) into (2.79) 


E, 


8 772 



(2.83) 


Degeneracy of energy levels 
If the box is cubic, we have a = 

77 

^n x ,n y ,n z 


b = c and the energy levels become 


fnl + nl+n 2 ) 


(2.84) 


The lowest or zero-point energy is £ 1 , 1,1 = 3 h 2 /%ma 2 , which is three times the 
zero-point energy for a particle in a one-dimensional box of the same length. 
The second or next-highest value for the energy is obtained by setting one of 
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Table 2.1. Energy levels for a particle in a three- 
dimensional box with a = b = c 


Energy Degeneracy 

Values of n x , n v , n : 

3(h 2 /8ma 2 ) 

1 

1 ,U 



6(h 2 /8ma 2 ) 

3 

2 , 1,1 

1 , 2,1 

1 , 1,2 

9(h 2 /Sma 2 ) 

3 

2 , 2,1 

2 , 1,2 

1 , 2,2 

1 \(h 2 l&ma 2 ) 

3 

3,1,1 

1,3,1 

1,1,3 

\2(hr Uma 2 ) 

1 

2 , 2,2 



I4(h 2 /Sma 2 ) 

6 

3,2,1 

3,1,2 

2,3,1 2,1,3 1,3,2 1,2,3 


Table 2.2. Energy levels for 
dimensional box with b 

a particle in a three- 
= a^, c = a/3 

Energy 

Degeneracy 

Values of n x , n y , n : 

\4(h 2 Uma 2 ) 

1 

1,U 

\l{h 2 /%ma 2 ) 

1 

2 , 1,1 

22(h 2 /Sma 2 ) 

1 

3,U 

26{h 2 ISma 2 ) 

1 

1 , 2,1 

29(hr Uma 2 ) 

2 

2,2,1 4,1,1 

3\(h 2 l%ma 2 ) 

1 

3,2,1 

38(/? 2 /8»?a 2 ) 

1 

5,U 

41(/r/8ma 2 ) 

2 

1,1,2 4,2,1 


the integers n x , n y , n : equal to 2 and the remaining ones equal to unity. Thus, 
there are three ways of obtaining the value 6h 2 /Sma 2 , namely, E 2 , 1 , 1 , £ 1 , 2 , 1 , 
and £ 1 , 1 , 2 . Each of these three possibilities corresponds to a different wave 
function, respectively, Vfi,2,i(r)> and ?/' 1 , 1,2 (r). An energy level that 

corresponds to more than one wave function is said to be degenerate. The 
second energy level in this case is threefold or triply degenerate. The zero- 
point energy level is non-degenerate. The energies and degeneracies for the 
first six energy levels are listed in Table 2.1. 

The degeneracies of the energy levels in this example are the result of 
symmetry in the lengths of the sides of the box. If, instead of the box being 
cubic, the lengths of b and c in terms of a were b = a/2, c = a/3, then the 
values of the energy levels and their degeneracies are different, as shown in 
Table 2.2 for the lowest eight levels. 

Degeneracy is discussed in more detail in Chapter 3. 
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Problems 

2.1 Consider a particle in a one-dimensional box of length a and in quantum state n. 
What is the probability that the particle is in the left quarter of the box 
(0 =£ x =£ a/4)? For which state n is the probability a maximum? What is the 
probability that the particle is in the left half of the box (0 =s x a/2)1 

2.2 Consider a particle of mass m in a one-dimensional potential such that 

V(x) — 0, —a/2 =£ x ss a/2 

— oo, x<—a/2, x> a/2 

Solve the time-independent Schrodinger equation for this particle to obtain the 
energy levels and the normalized wave functions. (Note that the boundary 
conditions are different from those in Section 2.5.) 

2.3 Consider a particle of mass m confined to move on a circle of radius a. Express 
the Hamiltonian operator in plane polar coordinates and then determine the energy 
levels and wave functions. 

2.4 Consider a particle of mass m and energy E approaching from the left a potential 
barrier of height Vo, as shown in Figure 2.3 and discussed in Section 2.6. However, 
suppose now that E is greater than Vo (E > Vo). Obtain expressions for the 
reflection and transmission coefficients for this case. Show that T equals unity 
when E — Vo — n 2 Ji 2 h 2 /2ma 2 for n — 1 , 2 ,... Show that between these periodic 
maxima T has minima which lie progressively closer to unity as E increases. 

2.5 Find the expression for the transmission coefficient T for Problem 2.4 when the 
energy E of the particle is equal to the potential barrier height Vq. 



3 

General principles of quantum theory 


3.1 Linear operators 

The wave mechanics discussed in Chapter 2 is a linear theory. In order to 
develop the theory in a more formal manner, we need to discuss the properties 
of linear operators. An operator A is a mathematical entity that transforms a 
function xp into another function 0 

0 = Aip (3.1) 


Throughout this book a circumflex is used to denote operators. For example, 
multiplying the function xp(x) by the variable x to give a new function <p(x) 
may be regarded as operating on the function xp(x) with the operator x, where x 
means multiply by x: 0(x) = xxp(x) = xxp(x). Generally, when the operation is 
simple multiplication, the circumflex on the operator is omitted. The operator 
D x , defined as d/dx, acting on xp(x) gives the first derivative of ip(x) with 
respect to x, so that in this case 


0 


D x ip 


d xp 
dx 


The operator A may involve a more complex procedure, such as taking the 
integral of xp with respect to x either implicitly or between a pair of limits. 

The operator A is linear if it satisfies two criteria 


i( 0 i + xp 2 ) = Axpi + Axp 2 


(3.2a) 


A(cxp) = c Axp 


(3.2b) 


where c is any complex constant. In the three examples given above, the 
operators are linear. Some nonlinear operators are ‘exp’ (take the exponential 
of) and [ ] 2 (take the square of), since 


65 



66 


General principles of quantum theory 

pX+y — P x y 


= e x e- v fe x + e y 
e cx f ce x 

[.x + y f = x 2 + 2 xy + y 2 f x 2 + y 2 
[c(x + y)f f c[x + y] 2 

The operator C is the sum of the operators A and B if 
Cxp = (A + B)xp = Arp + Bxp 
The operator C is the product of the operators A and B if 

Cxp = ABxp = A(Brp) 

where first B operates on rp and then A operates on the resulting function. 
Operators obey the associative law of multiplication, namely 

A(BC) = ( AB)C 

Operators may be combined. Thus, the square A 2 of an operator A is just the 
product AA 

A 2 ip = AAxp = A(Arp) 

Similar definitions apply to higher powers of A. As another example, the 
differential equation 

may be written as (£> 2 + k 2 )y = 0, where the operator (D 2 + k 2 ) is the sum of 
the two product operators Z) 2 and k 2 . 

In multiplication, the order of A and B is important because ABrp is not 

necessarily equal to BAip. For example, if A = x and B = D x , then we have 

ABxp = xD x rp = x(drp/dx) while, on the other hand, BAip = D x (xrp) = 
ip + x(drp/dx). The commutator of A and B, written as [A, B ], is an operator 
defined as 

[A, B]=AB- BA (3.3) 

from which it follows that [A, B] = — [B, A], If ABxp = BAxp, then we have 
AB = BA and [A, B] = 0; in this case we say that A and B commute. By 
expansion of each side of the following expressions, we can readily prove the 
relationships 

[A, BC] = [A, B]C + B[A, C] (3.4a) 

[AB, C] = [A, C]B + A[B, C] (3.4b) 

The operator A is the reciprocal of B if AB = BA = 1, where 1 may be 
regarded as the unit operator, i.e., ‘multiply by unity’. We may write A = B 1 
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and B = A~ l . If the operator A possesses a reciprocal, it is non-singular, in 
which case the expression 0 = Axp may be solved for 0, giving ip = A~ l (p. If 
A possesses no reciprocal, it is singular and the expression 0 = Aip may not be 
inverted. 


3.2 Eigenfunctions and eigenvalues 

Consider a finite set of functions f ) and the relationship 

c\f\ + C 2/2 + ■ ■ ■ + c„fn = 0 

where c\, C 2 , ■ ■ ■ are complex constants. If an equation of this form exists, then 
the functions are linearly dependent. However, if no such relationship exists, 
except for the trivial one with c\ = C 2 = ■ ■ ■ = c n = 0, then the functions are 
linearly independent. This definition can be extended to include an infinite set 
of functions. 

In general, the function 0 obtained by the application of the operator A on 
an arbitrary function ip, as expressed in equation (3.1), is linearly independent 
of 0. However, for some particular function ip\, it is possible that 

-401 = ai0i 

where cq is a complex number. In such a case ip\ is said to be an eigenfunction 
of A and cq is the corresponding eigenvalue. For a given operator A, many 
eigenfunctions may exist, so that 

Axpi = CLiipj (3.5) 

where 0, are the eigenfunctions, which may even be infinite in number, and a, 
are the corresponding eigenvalues. Each eigenfunction of A is unique, that is to 
say, is linearly independent of the other eigenfunctions. 

Sometimes two or more eigenfunctions have the same eigenvalue. In that 
situation the eigenvalue is said to be degenerate. When two, three, ..., n 
eigenfunctions have the same eigenvalue, the eigenvalue is doubly, triply, ..., 
n-fold degenerate. When an eigenvalue corresponds only to a single eigenfunc¬ 
tion, the eigenvalue is non-degenerate. 

A simple example of an eigenvalue equation involves the operator D x 
mentioned in Section 3.1. When D x operates on c kx , the result is 

D,e fe = ^e kx = A'e fa " 

dx 

Thus, the exponentials c kx are eigenfunctions of D x with corresponding 
eigenvalues k. Since both the real part and the imaginary part of k can have 
any values from —00 to + 00 , there are an infinite number of eigenfunctions 
and these eigenfunctions form a continuum of functions. 
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Another example is the operator D 2 X acting on either sin nx or cos nx, where 
n is a positive integer (n 3= 1), for which we obtain 

D 2 x sin nx = — n 2 sin nx 
D 2 cos nx = — n 2 cos nx 

The functions sin nx and cos nx are eigenfunctions of D 2 with eigenvalues 
— n 2 . Although there are an infinite number of eigenfunctions in this example, 
these eigenfunctions form a discrete, rather than a continuous, set. 

In order that the eigenfunctions ip, have physical significance in their 
application to quantum theory, they are chosen from a special class of func¬ 
tions, namely, those which are continuous, have continuous derivatives, are 
single-valued, and are square integrable. We refer to functions with these 
properties as well-behaved functions. Throughout this book we implicitly 
assume that all functions are well-behaved. 


Scalar product and orthogonality 

The scalar product of two functions 0(x) and 0(x) is defined as 

•OO 

0*(x)0(x) dx 

J—OO 

For functions of the three cartesian coordinates x, y, z, the scalar product of 
0(x, y, z ) and 0(x, y, z) is 

•OO 

0*(x, y, z)xp{x, y, z)dxdydz 

— OO 


For the functions ip(r, 6, <p) and 0(r, 6, <p) of the spherical coordinates r, 0, 
tp, the scalar product is 


*2jt 

*7T 

o . 

o. 


( p*(r , 6, (p)tp(r, 6, tp)r 2 sinOdrdOdcp 

In order to express equations in general terms, we adopt the notation J dr to 
indicate integration over the full range of all the coordinates of the system 
being considered and write the scalar product in the form 


0*0 dr 


For further convenience we also introduce a notation devised by Dirac and 
write the scalar product of 0 and 0 as (0 | ip), so that 


(0 \f) = 


0*0 dr 


The significance of this notation is discussed in Section 3.6. From the definition 
of the scalar product and of the notation (0 | ip), we note that 
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(0 I f)* = (0l0> 

(0 | exp) = c(0 | 0) 

(c0 | Xp > = C*(0|0) 

where c is an arbitrary complex constant. Since the integral {xp\xp)* equals 
(xp | xp), the scalar product (xp \ xp) is real. 

If the scalar product of xp and 0 vanishes, i.e., if (0 | xp) = 0, then xp and 0 
are said to be orthogonal. If the eigenfunctions xpj of an operator A obey the 
expressions 

(xpj | xpi) = 0 all i, j with i f j 

the functions xp t form an orthogonal set. Furthermore, if the scalar product of 
xpj with itself is unity, the function xpj is said to be normalized. A set of 
functions which are both orthogonal to one another and normalized are said to 
be orthonormal 


(xpj | xpi) = d v 

where dy is the Kronecker delta function, 

<hj = 1, i = j 

= 0, if j 


(3.6) 

(3.7) 


3.3 Hermitian operators 


The linear operator A is hermitian with respect to the set of functions xpj of the 
variables q\, q 2 ,... if it possesses the property that 


xp*Axpj dr 


xpfAxpjf dr 


(3.8) 


The integration is over the entire range of all the variables. The differential dr 
has the form 

dr = w(q i, q 2 , •. .)d#i dq 2 ■ ■■ 

where w(q \, q 2 , ...) is a weighting function that depends on the choice of the 
coordinates q\, q 2 , ... For cartesian coordinates the weighting function 
w(x, y, z) equals unity; for spherical coordinates, w(r, 6, <p) equals r 2 sin 0. 
Special variables introduced to simplify specific problems have their own 
weighting functions, which may differ from unity (see for example Section 
6.3). Equation (3.8) may also be expressed in Dirac notation 

(xpj\Axpi) = (Axpj\xpi) (3.9) 

in which the brackets indicate integration over all the variables using their 
weighting function. 
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For illustration, we consider some examples involving only one variable, 
namely, the cartesian coordinate x, for which w(x) = 1. An operator that results 
in multiplying by a real function f(x) is hermitian, since in this case 
fix)* = /(x) and equation (3.8) is an identity. Likewise, the momentum 
operator p = (h/ i)(d/dx), which was introduced in Section 2.3, is hermitian 
since 


ip*pipi dx 


* h&xpi h * 

A7 47 dx = 7^' 


h 

i 


fr 


dx 


J dx 


The integrated part is zero if the functions xpj vanish at infinity, which they 
must in order to be well-behaved. The remaining integral is J ipiP*ip* dx, so 
that we have 


•OO 

ip* pipi dx 

— OO 


*oo 

xpiipxpjf dx 

—OO 


The imaginary unit i contained in the operator p is essential for the 
hermitian character of that operator. The operator D x = d/dx is not hermitian 
because 


, * dipt 

f ±7 dx = - 


ft 


d 

dx 


1 dx 


(3.10) 


where again the integrated part vanishes. The negative sign on the right-hand 
side of equation (3.10) indicates that the operator is not hermitian. The operator 
D 2 X , however, is hermitian. 

The hermitian character of an operator depends not only on the operator 
itself, but also on the functions on which it acts and on the range of integration. 
An operator may be hermitian with respect to one set of functions, but not with 
respect to another set. It may be hermitian with respect to a set of functions 
defined over one range of variables, but not with respect to the same set over a 
different range. For example, the hermiticity of the momentum operator p is 
dependent on the vanishing of the functions xpi at infinity. 

The product of two hermitian operators may or may not be hermitian. 
Consider the product AB where A and B are separately hermitian with respect 
to a set of functions xp t , so that 

(ipj | ABxpi) = (A-ipj | Bxpi) = (.BAxpj \ xp t ) (3.11) 

where we have assumed that the functions Atpj and Bip, also lie in the hermitian 
domain of A and B. The product AB is hermitian if, and only if, A and B 
commute. Using the same procedure, one can easily demonstrate that if A and 
B do not commute, then the operators (AB + BA) and i [A, B ] are hermitian. 

By setting B equal to A in the product AB in equation (3.11), we see that the 
square of a hermitian operator is hermitian. This result can be generalized to 
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any integral power of A. Since \Aip\ 2 is always positive, the integral (Arp \ Arp) 
is positive and consequently 

(ip | A 2 ip) 5 = 0 (3.12) 


Eigenvalues 

The eigenvalues of a hermitian operator are real. To prove this statement, we 
consider the eigenvalue equation 

Aip = aip (3.13) 

where A is hermitian, rp is an eigenfunction of A, and a is the corresponding 
eigenvalue. Multiplying by ip* and integrating give 

(ip\Aip) = a(ip\ip) (3.14) 

Multiplication of the complex conjugate of equation (3.13) by ip and integrat¬ 
ing give 

(Aip\ip) = a*(ip\ip) (3.15) 

Because A is hermitian, the left-hand sides of equations (3.14) and (3.15) are 
equal, so that 

(a — a*)(ip | ip) = 0 (3.16) 

Since the integral in equation (3.16) is not equal to zero, we conclude that 
a = a* and thus a is real. 


Orthogonality theorem 

If rp i and rp 2 are eigenfunctions of a hermitian operator A with different 
eigenvalues cq and a 2 , then rp\ and rp 2 are orthogonal. To prove this theorem, 
we begin with the integral 

(ip 2 \Aipx) = a x (ip 2 | tpi) (3.17) 

Since A is hermitian and a 2 is real, the left-hand side may be written as 

(xp 2 \Axp\) = (Aip 2 \ipi) = a 2 (ip 2 \f\) 

Thus, equation (3.17) becomes 

(a 2 - ai){ip 2 \ipi) =0 

Since cq f a 2 , the functions rp\ and rp 2 are orthogonal. 

Since the Dirac notation suppresses the variables involved in the integration, 
we re-express the orthogonality relation in integral notation 

q 2 , .. .)tpi(qi, qi, ■ ■ -)Mqi, qi, ■ ■ ■) dqr dq 2 ... = 0 

This expression serves as a reminder that, in general, the eigenfunctions of a 
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hermitian operator involve several variables and that the weighting function 
must be used. The functions are, therefore, orthogonal with respect to the 
weighting function w{q\, q 2 , ■ ■ .). 

If the weighting function is real and positive, then we can define f\ and 0 2 
as 

01 = \/wXpl, 02 = \fw\p2 

The functions 0i and 0 2 are then mutually orthogonal with respect to a 
weighting function of unity. Moreover, if the operator A is hermitian with 
respect to 0i and 0 2 with a weighting function w, then A is hermitian with 
respect to 0i and 0 2 with a weighting function equal to unity. 

If two or more linearly independent eigenfunctions have the same eigen¬ 
value, so that the eigenvalue is degenerate, the orthogonality theorem does not 
apply. However, it is possible to construct eigenfunctions that are mutually 
orthogonal. Suppose there are two independent eigenfunctions 0i and 0 2 of 
the operator A with the same eigenvalue a. Any linear combination 
C\ip\ T" c:y0 2 , where ci and c 2 are any pair of complex numbers, is also an 
eigenfunction of A with the same eigenvalue, so that 

A(c i 0 i + c 2 ip 2 ) = c\Axp\ + c 2 Aip 2 = a(c\ip\ + c 2 ip 2 ) 

From any pair ip\, ip 2 which initially are not orthogonal, we can construct by 
selecting appropriate values for ci and c 2 a new pair which are orthogonal. By 
selecting different sets of values for c\, c 2 , we may obtain infinitely many new 
pairs of eigenfunctions which are mutually orthogonal. 

As an illustration, suppose the members of a set of functions 0i ,ip 2 , , ip n 
are not orthogonal. We define a new set of functions 0 1 , 0 2 , - - -, 0„ by the 
relations 


01 = 01 

02 = «01 +02 

03 = hi 01 + b 2 <p 2 + 03 


If we require that 0 2 be orthogonal to 0i by setting (0i | 0 2 ) = 0, then the 
constant a is given by 

a = -(01 | 0 2 )/(01 I 0i) = -(01 I 02>/(01 | 01) 

and 0 2 is determined. We next require 03 to be orthogonal to 0i and to 0 2 , 
which gives 
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In general, we have 


b\ = -<0i l^3>/(01 |0l) 
b 2 = -(02 | 03)/(02 | 02) 


j-1 

0.v 0j “F ^ ) k s! (pi 
1=1 


^,S7 — (01 I 0s:)/ (01 I 0l) 

This construction is known as the Schmidt orthogonalization procedure. Since 
the initial selection for (p\ can be any of the original functions >p, or any linear 
combination of them, an infinite number of orthogonal sets 0, can be obtained 
by the Schmidt procedure. 

We conclude that all eigenfunctions of a hermitian operator are either 
mutually orthogonal or, if belonging to a degenerate eigenvalue, can be chosen 
to be mutually orthogonal. Throughout the remainder of this book, we treat all 
the eigenfunctions of a hermitian operator as an orthogonal set. 


Extended orthogonality theorem 

The orthogonality theorem can also be extended to cover a somewhat more 
general form of the eigenvalue equation. For the sake of convenience, we 
present in detail the case of a single variable, although the treatment can be 
generalized to any number of variables. Suppose that instead of the eigenvalue 
equation (3.5), we have for a hermitian operator A of one variable 

Aipi{x) = ajw(x)ipj(x) (3.18) 


where the function w(x) is real, positive, and the same for all values of i. 
Therefore, equation (3.18) can also be written as 

A* ip* (x) = a* w(x)ip* (x) (3.19) 

Multiplication of equation (3.18) by \p*(x) and integration over x give 


rp* (x)Arp t(x) dx = cq 


ip*{x)ipi{x)w{x) dx 


(3.20) 


Now, the operator A is hermitian with respect to the functions y>, with a 
weighting function equaling unity, so that the integral on the left-hand side of 
equation (3.20) becomes 


ip* (x)Arpj(x) dx 


xp i(x)A* ip* (x) dx = a* 


xp*(x)xpi(x)w(x) dx 


where equation (3.19) has been used as well. Accordingly, equation (3.20) 
becomes 
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(at 



f*(x)fi(x)w(x) dx 


0 


(3.21) 


When j = i, the integral in equation (3.21) cannot vanish because the 
product ipftpi and the function w(x) are always positive. Therefore, we have 
a, = a* and the eigenvalues a, are real. For the situation where i f j and 
a, f a*, the integral in equation (3.21) must vanish, 

rp* (x)rp i(x)w(x) dx = 0 (3.22) 


Thus, the set of functions ipfx) for non-degenerate eigenvalues are mutually 
orthogonal when integrated with a weighting function w(x). Eigenfunctions 
corresponding to degenerate eigenvalues can be made orthogonal as discussed 
earlier. 

The discussion above may be generalized to more than one variable. In the 
general case, equation (3.18) is replaced by 

Axpfqu q 2 , ...) = a.jw(qi, q 2 , ■ .. )ipfq \, qi, ■ ■ ■) (3.23) 

and equation (3.22) by 

ip*(qu q 2 , .. • )*pi(q\ , qi, ■ ■ -)Mqi, qi, ■ ■ •) d^i dq 2 ... = 0 (3.24) 


Equation (3.18) can also be transformed into the more usual form, equation 
(3.5). We first define a set of functions (p,(x) as 

0.(x) = [w(x)] l/2 rpi(x) = ip i(x) / u(x) (3.25) 

where 

u(x) = [w(x)]^ 1/2 (3.26) 

The function u(x) is real because w(x) is always positive and u(x) is positive 
because we take the positive square root. If w(x) approaches infinity at any 
point within the range of hermiticity of A (as x approaches infinity, for 
example), then xpfx) must approach zero such that the ratio (p,(x) approaches 
zero. Equation (3.18) is now multiplied by u(x) and tpfx) is replaced by 
u(x)(f) i(x) 

u(x)Au(x)(p ,(x) = a i w(x) [ u(x)] 2 0 fx) 

If we define an operator B by the relation B = u(x)A u(x) and apply equation 
(3.26), we obtain 

B(pi(x) = a i 0 / (x) 

which has the form of equation (3.5). We observe that 
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ip*Axpj dx 


( p*uAucpj dx 


0*50, dx 


(Axpjfipi dx 


( Au(pj)*u(pi dx 


(50/0, dx 


Since A is hermitian with respect to the 0,-s, the two integrals on the left of 
each equation equal each other, from which it follows that 


0*50, dx 


(50/0,- dx 


and 5 is therefore hermitian with respect to the 0,-s. 


3.4 Eigenfunction expansions 

Consider a set of orthonormal eigenfunctions 0,- of a hermitian operator. Any 
arbitrary function / of the same variables as 0, defined over the same range of 
these variables may be expanded in terms of the members of set 0, 


f=Yl a ^ Ji 

i 


(3.27) 


where the a,s are constants. The summation in equation (3.27) converges to the 
function / if the set of eigenfunctions is complete. By complete we mean that 
no other function g exists with the property that (g \ 0,) = 0 for any value of i, 
where g and 0, are functions of the same variables and are defined over the 
same variable range. As a general rule, the eigenfunctions of a hermitian 
operator are not only orthogonal, but are also complete. A mathematical 
criterion for completeness is presented at the end of this section. 

The coefficients a,- are evaluated by multiplying (3.27) by the complex 
conjugate 0* of one of the eigenfunctions, integrating over the range of the 
variables, and noting that the 0,-s are orthonormal 


(VV I /> 




^' 0 / 


^fl,(0y|0,) = dj 


Replacing the dummy index j by i, we have 

at = {Vi I /> 

Substitution of equation (3.28) back into (3.27) gives 


f = J2^’\f)^i 

i 


(3.28) 


(3.29) 



76 


General principles of quantum theory 


Completeness 

We now evaluate (f \ f) in which / and f* are expanded as in equation (3.27), 
with the two independent summations given different dummy indices 

(/1/) = (y a jVj Y= YY a l "/('/’/1'/-’<•) = kh-1 2 

Without loss of generality we may assume that the function / is normalized, so 
that (/1 /) = 1 and 


Y^ |c/| 2 = 1 (3.30) 


Equation (3.30) may be used as a criterion for completeness. If an eigenfunc¬ 
tion ?/>„ with a non-vanishing coefficient a„ were missing from the summation 
in equation (3.27), then the series would still converge, but it would be 
incomplete and would therefore not converge to /. The corresponding coeffi¬ 
cient a n would be missing from the left-hand side of equation (3.30). Since 
each term in the summation in equation (3.30) is positive, the sum without a n 
would be less than unity. Only if the expansion set ip, in equation (3.27) is 
complete will (3.30) be satisfied. 

The completeness criterion can also be expressed in another form. For this 
purpose we need to introduce the variables explicitly. For simplicity we assume 
first that / is a function of only one variable x. In this case, equation (3.29) is 


/(*) = Y 


(*')/(*') dx' 


ipfx) 


where x' is the dummy variable of integration. Interchanging the order of 
summation and integration gives 


f(x) 


Y v* i x 'ff( x ) 


fix') dx' 


Thus, the summation is equal to the Dirac delta function (see Appendix C) 


Y^ (x')ipi(x) = d(x — x') (3.31) 

i 

This expression, known as the completeness relation and sometimes as the 
closure relation, is valid only if the set of eigenfunctions is complete, and may 
be used as a mathematical test for completeness. Notice that the completeness 
relation (3.31) is not related to the choice of the arbitrary function /, whereas 
the criterion (3.30) is related. 

The completeness relation for the multi-variable case is slightly more 
complex. When expressed explicitly in terms of its variables, equation (3.29) is 
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f(qu 02, •■■) = 5 ^ 


tyfiqi, 02, • • 0/(01, 02, • • -M01, 02, • • 0 d01, d<72, • • • 


x -1/0(01, 02, • • 0 

Interchanging the order of summation and integration gives 


/(0i> 02, •••) = 


'Yi'P* ( 01 , 02 , • - 0 / 1 ( 01 , 02 , •••) 


X /(/, <72, ■ • Ow(01, 02, • • 0 d 01 d</2 . . . 
so that the completeness relation takes the form 

W(0i, 02, ■ • 0 X] ^f(01’ 02, • • OVh(01, 02, • • 0 = <5(01 - 0l)<5(02 - 02) • ■ • 

i 

(3.32) 


3.5 Simultaneous eigenfunctions 

Suppose the members of a complete set of functions 1/0 are simultaneously 
eigenfunctions of two hermitian operators d and B with eigenvalues a, and //, 
respectively 

dr/q = ayp, 

Bxpi = fiiXpi 

If we operate on the first eigenvalue equation with B and on the second with d, 
we obtain 

BAipj = a.jBipj = ajfijipj 
ABxpi = ffArpj = aftiipi 

from which it follows that 

(AB - BA yip, = [A, B]\p, = 0 

Thus, the functions rp, are eigenfunctions of the commutator [A, B ] with 
eigenvalues equal to zero. An operator that gives zero when applied to any 
member of a complete set of functions is itself zero, so that A and B commute. 
We have just shown that if the operators A and B have a complete set of 
simultaneous eigenfunctions, then A and B commute. 

We now prove the converse, namely, that eigenfunctions of commuting 
operators can always be constructed to be simultaneous eigenfunctions. 
Suppose that Aip t = a,ip, and that [A, B] = 0. Since A and B commute, we 
have 
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ABipj = BAipj = B(a.jipj ) = atBxpi 

Therefore, the function Bip, is an eigenfunction of A with eigenvalue a,-. 

There are now two possibilities; the eigenvalue a, of A is either non¬ 
degenerate or degenerate. If «, is non-degenerate, then it corresponds to only 
one independent eigenfunction ip u so that the function Bip, is proportional to 

fi 

Bxpi = fftpi 

where is the proportionality constant and therefore the eigenvalue of B 
corresponding to ip,. Thus, the function ip , is a simultaneous eigenfunction of 
both A and B. 

On the other hand, suppose the eigenvalue a, is degenerate. For simplicity, 
we consider the case of a doubly degenerate eigenvalue ap, the extension to n- 
fold degeneracy is straightforward. The function ip, is then any linear combina¬ 
tion of two linearly independent, orthonormal eigenfunctions ip a and ipa of A 
corresponding to the eigenvalue a, 

fpi = Clip n + c 2 ipa 

We need to determine the coefficients c\, c 2 such that Bip , = ffrpi, that is 
cyBxpn + c 2 Bxpi 2 = Pi(ciipn + c 2 ip i2 ) 

If we take the scalar product of this equation first with ip a and then with ip i2 , 
we obtain 

c\(B\\ — fij) + c 2 B[ 2 = 0 
c\B 2 \ + c 2 (B 22 — fii) = 0 
where we have introduced the simplified notation 

Bjk = {'ipij | Bip ik ) 

These simultaneous linear homogeneous equations determine c\ and c 2 and 
have a non-trivial solution if the determinant of the coefficients of c\, c 2 
vanishes 

B\\ — fii B n _ 

B 2 \ B 22 — Pi 

or 

P] ~ (B\\ + B 22 )Pi + B\\B 22 — B\ 2 B 2 \ = 0 

This quadratic equation has two roots P) ] and pf\ which lead to two 
corresponding sets of constants c\ l) , cy 1 and cf\ cp . Thus, there are two 
distinct functions ip\ l) and ipj 
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ip\ l) = c[ l) lp n + cfxpa 
Wi ~ c i Til ~r C 2 Til 

which satisfy the relations 

Biff = ySfVf 

and are, therefore, simultaneous eigenfunctions of the commuting operators A 
and B. 

This analysis can be extended to three or more operators. If three operators 
A, B, and C have a complete set of simultaneous eigenfunctions, then the 
argument above shows that A and B commute, B and C commute, and A and C 
commute. Furthermore, the converse is also true. If A commutes with both B 
and C, and B commutes with C, then the three operators possess simultaneous 
eigenfunctions. To show this, suppose that the three operators commute with 
one another. We know that since A and B commute, they possess simultaneous 
eigenfunctions \j) l such that 

Aipi = a i \p l 
Bipi = fiiipi 

We next operate on each of these expressions with C, giving 
CAtpi = A(Cijjj) = ClfLiipi) = afCipi) 

CBxpi = B(Cxpi) = C{fiiXpi) = ffiCfi) 

Thus, the function Cip, is an eigenfunction of both A and B with eigenvalues a, 
and fit, respectively. If a, and ji, are non-degenerate, then there is only one 
eigenfunction ip , corresponding to them and the function Cipj is proportional 
to xpi 

Cipi = fiXpi 

and, consequently, A, B, and C possess simultaneous eigenfunctions. For 
degenerate eigenvalues a,- and/or [f, simultaneous eigenfunctions may be 
constructed using a procedure parallel to the one described above for the 
doubly degenerate two-operator case. 

We note here that if A commutes with Ii and B commutes with C, but A does 
not commute with C, then A and B possess simultaneous eigenfunctions, B and 
C possess simultaneous eigenfunctions, but A and C do not. The set of 
simultaneous eigenfunctions of A and B will differ from the set for B and C. 
An example of this situation is discussed in Chapter 5. 



80 


General principles of quantum theory 


In some of the derivations presented in this section, operators need not be 
hermitian. However, we are only interested in the properties of hermitian 
operators because quantum mechanics requires them. Therefore, we have 
implicitly assumed that all the operators are hermitian and we have not 
bothered to comment on the parts where hermiticity is not required. 


3.6 Hilbert space and Dirac notation 

This section introduces the basic mathematics of linear vector spaces as an 
alternative conceptual scheme for quantum-mechanical wave functions. The 
concept of vector spaces was developed before quantum mechanics, but Dirac 
applied it to wave functions and introduced a particularly useful and widely 
accepted notation. Much of the literature on quantum mechanics uses Dirac’s 
ideas and notation. 

A set of complete orthonormal functions ipfx) of a single variable x may be 
regarded as the basis vectors of a linear vector space of either finite or infinite 
dimensions, depending on whether the complete set contains a finite or infinite 
number of members. The situation is analogous to three-dimensional cartesian 
space formed by three orthogonal unit vectors. In quantum mechanics we 
usually (see Section 7.2 for an exception) encounter complete sets with an 
infinite number of members and, therefore, are usually concerned with linear 
vector spaces of infinite dimensionality. Such a linear vector space is called a 
Hilbert space. The functions ffx) used as the basis vectors may constitute a 
discrete set or a continuous set. While a vector space composed of a discrete 
set of basis vectors is easier to visualize (even if the space is of infinite 
dimensionality) than one composed of a continuous set, there is no mathema¬ 
tical reason to exclude continuous basis vectors from the concept of Hilbert 
space. In Dirac notation, the basis vectors in Hilbert space are called ket 
vectors or just kets and are represented by the symbol |?/>,) or sometimes 
simply by | i). These ket vectors determine a ket space. 

When a ket | xp ( ) is multiplied by a constant c, the result c \xp t ) = \cxpt) is a 
ket in the same direction as |t/q); only the magnitude of the ket vector is 
changed. However, when an operator A acts on a ket |?/>,), the result is another 
ket | (pi) 

10/) = A 0/) = MV/) 

In general, the ket |0,) is not in the same direction as |'*/>,) nor in the same 
direction as any other ket | tpj), but rather has projections along several or all 
basis kets. If an operator A acts on all kets \y>,) of the basis set, and the 
resulting set of kets |0,) = \Aip-) are orthonormal, then the net result of the 
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operation is a rotation of the basis set | \p t ) about the origin to a new basis set 
|0,). In the situation where A acting on |'0,) gives a constant times 10,) (cf. 
equation (3.5)) 

Al'H’i) = M0/) = «/|0/) 

the ket \Atpj) is along the direction of |0,) and the kets 10,) are said to be 
eigenkets of the operator A. 

Although the expressions A\tpj) and | Aipj) are completely equivalent, there 
is a subtle distinction between them. The first, A\xpi), indicates the operator A 
being applied to the ket |0,j. The quantity | Aipj) is the ket which results from 
that application. 


Bra vectors 

The functions 0,(x) are, in general, complex functions. As a consequence, ket 
space is a complex vector space, making it mathematically necessary to 
introduce a corresponding set of vectors which are the adjoints of the ket 
vectors. The adjoint (sometimes also called the complex conjugate transpose) 
of a complex vector is the generalization of the complex conjugate of a 
complex number. In Dirac notation these adjoint vectors are called bra vectors 
or bras and are denoted by jp,\ or ( i \. Thus, the bra {'0,j is the adjoint [ip,y of 
the ket \xpi) and, conversely, the ket |'0,) is the adjoint (?/’,! T of the bra {xp ,j 

10/) f = (0/1 


(0/| f = |0<) 


These bra vectors determine a bra space, just as the kets determine ket space. 

The scalar product or inner product of a bra (01 and a ket \xp) is written in 
Dirac notation as (0|0) and is defined as 


( 010 ) 


0*(x)0(x) dx 


The bracket ( bra-c-ket) in (0['*/>) provides the names for the component 
vectors. This notation was introduced in Section 3.2 as a shorthand for the 
scalar product integral. The scalar product of a ket \xp) with its corresponding 
bra (xp\ gives a real, positive number and is the analog of multiplying a 
complex number by its complex conjugate. The scalar product of a bra (r/j j 
and the ket | Axp { ) is expressed in Dirac notation as (xpj\A\xpi) or as (j\A\i). 
These scalar products are also known as the matrix elements of A and are 
sometimes denoted by A ,y. 

To every ket in ket space, there corresponds a bra in bra space. For the ket 
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c\ipi), the corresponding bra is c*(0,j. We can also write c|0 ; -) as |c0 r ), in 
which case the corresponding bra is (c0,j, so that 

{cxpi\ = c*(rpi\ 

For every linear operator A that transforms |0,} in ket space into |0,) = | Aipj), 
there is a corresponding linear operator A* in bra space which transforms (0,j 
into (0/| = (Atpi\. This operator A' is called the adjoint of A. In bra space the 
transformation is expressed as 

{Atpil = {ipi\A ] 

Thus, for bras the operator acts on the vector to its left, whereas for kets the 
operator acts on the vector to its right. 

To find the relationship between A and its adjoint A\ we take the scalar 


product of {Aipj\ and | ipj) 


(Axpj\xpi) = (0/1^10/) 

(3.33a) 

or in integral notation 


(Af>jf 0 / dx = 0 A^ipi dx 

(3.33b) 


A comparison with equation (3.8) shows that if A is hermitian, then we have 
A' = A and A is said to be self-adjoint. The two terms, hermitian and self- 
adjoint, are synonymous. To find the adjoint of a non-hermitian operator, we 
apply equations (3.33). For example, we see from equation (3.10) that the 
adjoint of the operator d/dx is —d/dx. 

Since the scalar product (0|0) is equal to (0 |0)*, we see that 

(Afjlfi) = {fi\A\fj)* (3.34) 

Combining equations (3.33a) and (3.34) gives 

(0y|i t |0 i ) = (rpi\A\fj)* (3.35) 

If we replace A in equation (3.35) by the operator A\ we obtain 

{fj\U ] ) ] \fi) = (0 ! -M t |0/)* (3.36) 

where (A^f is the adjoint of the operator A. Equation (3.35) may be rewritten 
as 

(0 i |i t |0 7 >* = (0;|i|0;) 

and when compared with (3.36), we see that 

(0 i |(i t ) t |0 I ) = (0,'|i|0/) 

We conclude that 

(i f ) f =i 

From equation (3.35) we can also show that 


(3.37) 
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(cA) j = c*A f (3.38) 

where c is any complex constant, and that 

(A + B) f = A f + B f (3.39) 

To obtain the adjoint of the product AB of two operators, we apply equation 
(3.33a), first to AB, then to A, and finally to B 

{ipj\{AB) ] \xpi) = (ABxpj\xpi) = {Btpj\A ] \tpi) = {xp j\B ] A ] \xp j) 

Thus, we have the relation 

(AB) ] = B ] A ] (3.40) 

If A and B are hermitian (self-adjoint), then we have (AB)' = BA and further, if 

A and B commute, then the product AB is hermitian or self-adjoint. 

The outer product of a bra (0| and a ket \>p) is \tp)((p\ and behaves as an 
operator. If we let this outer product operate on another ket |%), we obtain the 
expression |0)(0|%), which can be regarded in two ways. The scalar product 
(0|%) is a complex number multiplying the ket |?/>), so that the complete 
expression is a ket parallel to | ip). Alternatively, the operator |0)(0| acts on the 
ket |%) and transfroms \%) into a ket proportional to \ip). 

To find the adjoint of the outer product |%)(0| of the ket \%) and the bra (01, 
we let A in equation (3.35) be equal to |%)(0| and obtain 

(0;l(b'}(0l) t l'0*> = (Vfi|(bfX 0 l)IVv}* = (0/k>*(0l07>* 

= (Zl0<)(0il0) = (0;l0)Cd0;) = {Vj\(\<P)(x\)\Vi) 

Setting equal the operators in the left-most and right-most integrals, we find 
that 

(MM = \<P)(JC\ (3.41) 


Projection operator 

We define the operator P, as the outer product of [ip,) and its corresponding bra 

Pi = |0/)(0i| = |*)( l 1 (3.42) 

and apply P, to an arbitrary ket |0) 

H 4 >) = I ? 'X ? 10 > 

Thus, the result of P t acting on |0) is a ket proportional to | /), the proportion¬ 
ality constant being the scalar product (0/|0). The operator P,, then, projects 
|0) onto | ipi) and for that reason is known as the projection operator. The 
operator Pf is given by 

P ■ = PiPi = |i)(i|/)(/| = \i){i\ = Pi 

where we have noted that the kets | i) are normalized. Likewise, the operator P" 
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for n>2 also equals P,. This property is consistent with the interpretation of 
Pi as a projection operator since the result of projecting |0) onto \i) should be 
the same whether the projection is carried out once, twice, or multiple times. 
The operator P t is hermitian, so that the projection of |0) on \y>,) is equal to 
the projection of |0,) on |0). To show that P, is hermitian, we let \%) = 
|0) = | i) in equation (3.41) and obtain P\ = P t . 

The expansion of a function f(x) in terms of the orthonormal set ipfx), as 
shown in equation (3.27), may be expressed in terms of kets as 

i/) = ^2 a i\Vi) = ^2 a M) 

i i 

where |/) is regarded as a vector in ket space. The constants a t are the 
projections of |/) on the ‘unit ket vectors’ \i) and are given by equation (3.28) 

a, = ( i\f) 

Combining these two equations gives equation (3.29), which when expressed 
in Dirac notation is 


l/> = £lOM/> 

i 

Since f(x) is an arbitrary function of x, the operator )T,jzj{/| must equal the 
identity operator, so that 

5>'M = 1 (3-43) 

i 

From the definition of Pj in equation (3.42), we see that 


Since the operator ^,-|z) (zj equals unity, it may be inserted at any point in an 
equation. Accordingly, we insert it between the bra and the ket in the scalar 
product of |/) with itself 

</!/> = 



where we have assumed \ f) is normalized. This expression may be written as 

</l/> = £(/|i>(- 1 /> = £ l(i |/)| 2 = £ l «,| 2 = 1 

i i i 

Thus, the expression (3.43) is related to the completeness criterion (3.30) and 
is called, therefore, the completeness relation. 
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3.7 Postulates of quantum mechanics 

In this section we state the postulates of quantum mechanics in terms of the 
properties of linear operators. By way of an introduction to quantum theory, the 
basic principles have already been presented in Chapters 1 and 2. The purpose 
of that introduction is to provide a rationale for the quantum concepts by 
showing how the particle-wave duality leads to the postulate of a wave 
function based on the properties of a wave packet. Although this approach, 
based in part on historical development, helps to explain why certain quantum 
concepts were proposed, the basic principles of quantum mechanics cannot be 
obtained by any process of deduction. They must be stated as postulates to be 
accepted because the conclusions drawn from them agree with experiment 
without exception. 

We first state the postulates succinctly and then elaborate on each of them 
with particular regard to the mathematical properties of linear operators. The 
postulates are as follows. 


1. The state of a physical system is defined by a normalized function 'I* of the spatial 
coordinates and the time. This function contains all the information that exists on 
the state of the system. 

2. Every physical observable A is represented by a linear hermitian operator A. 

3. Every individual measurement of a physical observable A yields an eigenvalue of 
the corresponding operator A. The average value or expectation value (A) from a 
series of measurements of A for systems, each of which is in the exact same state 

is given by (A) == ( v E|^| l T}. 

4. If a measurement of a physical observable A for a system in state T gives the 
eigenvalue A„ of A, then the state of the system immediately after the measurement 
is the eigenfunction (if A„ is non-degenerate) or a linear combination of eigenfunc¬ 
tions (if A„ is degenerate) corresponding to A„. 

5. The time dependence of the state function T is determined by the time-dependent 
Schrodinger differential equation 


i h 


dW 

dt 


= 


where H is the Hamiltonian operator for the system. 


This list of postulates is not complete in that two quantum concepts are not 
covered, spin and identical particles. In Section 1.7 we mentioned in passing 
that an electron has an intrinsic angular momentum called spin. Other particles 
also possess spin. The quantum-mechanical treatment of spin is postponed until 
Chapter 7. Moreover, the state function for a system of two or more identical 
and therefore indistinguishable particles requires special consideration and is 
discussed in Chapter 8. 
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State function 

According to the first postulate, the state of a physical system is completely 
described by a state function Tfiq, t) or ket j 1 ! 1 ), which depends on spatial 
coordinates q and the time t. This function is sometimes also called a state 
vector or a wave function. The coordinate vector q has components q\, q 2 ,..., 
so that the state function may also be written as v F(c/i, q 2 , ..., t). For a particle 
or system that moves in only one dimension (say along the x-axis), the vector q 
has only one component and the state vector T is a function of x and 
t: Tfix, t). For a particle or system in three dimensions, the components of q 
are x, y, z and W is a function of the position vector r and t : *P(r, t). The state 
function is single-valued, a continuous function of each of its variables, and 
square or quadratically integrable. 

For a one-dimensional system, the quantity l F * (x, t) x V{x, t) is the probabil¬ 
ity density for finding the system at position x at time t. In three dimensions, 
the quantity 'F* (r, /) x F(r, /) is the probability density for finding the system at 
point r at time t. For a multi-variable system, the product l F*(<yi, q 2 , 
..., tf¥(q i, q 2 , ..., t) is the probability density that the system has coordi¬ 
nates q\, q 2 , ... at time t. We show below that this interpretation of T ,;, F 
follows from postulate 3. We usually assume that the state function is normal¬ 
ized 

^*(< 71 , qi, ■■■, t)W(q u q 2 , ..., t)w(q i, q 2 , ■ ■ -)dq\ dq 2 ... = 1 
or in Dirac notation 

(qr|rir) = i 

where the limits of integration are over all allowed values of qi, q 2 , ... 


Physical quantities or observables 

The second postulate states that a physical quantity or observable is represented 
in quantum mechanics by a hermitian operator. To every classically defined 
function A(r, p) of position and momentum there corresponds a quantum- 
mechanical linear hermitian operator A(r, (A/i)V). Thus, to obtain the quan¬ 
tum-mechanical operator, the momentum p in the classical function is replaced 
by the operator p 

h 

p = -V (3.44) 

i 

or, in terms of components 

* A d At d „ Ad 

Px = l dx’ Py = T <9y ’ Pz = ldl 
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For multi-particle systems with cartesian coordinates ri, r 2 , ..., the classical 
function A(r\, r 2 , ..., pi, P 2 , . ..) possesses the corresponding operator 
A( rj, r 2 , ..., (h/i)Vi, (h/ i)V 2 , ...) where V/. is the gradient with respect to 
r^. For non-cartesian coordinates, the construction of the quantum-mechanical 
operator A is more complex and is not presented here. 

The classical function A is an observable, meaning that it is a physically 
measurable property of the system. For example, for a one-particle system the 
Hamiltonian operator H corresponding to the classical Hamiltonian function 

H(r, p) = f + V (r) 

2 m 

where p 2 = p • p = p 2 x + p 2 + p 2 , is 

H = — y— V 2 + F (r) 

2 m 

The linear operator H is easily shown to be hermitian. 


Measurement of observable properties 

The third postulate relates to the measurement of observable properties. Every 
individual measurement of a physical observable A yields an eigenvalue a, of 
the operator A. The eigenvalues are given by 

A\i)=X i \i) (3.45) 

where fz) are the orthonormal eigenkets of A. Since A is hermitian, the 
eigenvalues are all real. It is essential for the theory that A is hermitian because 
any measured quantity must, of course, be a real number. If the spectrum of A 
is discrete, then the eigenvalues A,- are discrete and the measurements of A are 
quantized. If, on the other hand, the eigenfunctions |z) form a continuous, 
infinite set, then the eigenvalues A, are continuous and the measured values of 
A are not quantized. The set of eigenkets |z) of the dynamical operator A are 
assumed to be complete. In some cases it is possible to show explicitly that | i) 
forms a complete set, but in other cases we must assume that property. 

The expectation value or mean value (A) of the physical observable A at 
time t for a system in a normalized state l I ; is given by 

(A) = ( l P|i|'I') (3.46) 

If 'E is not normalized, then the appropriate expression is 

<W) 

Some examples of expectation values are as follows 
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(x) 

(Px) 


(W|x|W> 



< r > 

(P) 


(W\r\W) 



E=(H) 




+ m 



The expectation value (A) is not the result of a single measurement of the 
property A, but rather the average of a large number (in the limit, an infinite 
number) of measurements of A on systems, each of which is in the same state 
'If Each individual measurement yields one of the eigenvalues A,-, and (A) is 
then the average of the observed array of eigenvalues. For example, if the 
eigenvalue X\ is observed four times, the eigenvalue A 2 three times, the 
eigenvalue A 3 once, and no other eigenvalues are observed, then the expectation 
value (A) is given by 

_ 4A t + 3A 2 + A 3 
' ' “ 8 

In practice, many more than eight observations would be required to obtain a 
reliable value for (A). 

In general, the expectation value (A) of the observable A may be written for 
a discrete set of eigenfunctions as 


(A) = P ,A, (3.47) 

i 

where P, is the probability of obtaining the value A,-. If the state function V F for 
a system happens to coincide with one of the eigenstates \i), then only the 
eigenvalue A, would be observed each time a measurement of A is made and 
therefore the expectation value (A) would equal A; 

(A) = (i\A\i) = (z'|A,-|z) = A* 

It is important not to confuse the expectation value (A) with the time average 
of A for a single system. 

For an arbitrary state 'P at a fixed time t, the ket | X F) may be expanded in 
terms of the complete set of eigenkets of A. In order to make the following 
discussion clearer, we now introduce a slightly more complicated notation. 
Each eigenvalue A, will now be distinct, so that A, f Ay for i f j. We let g, be 



3.7 Postulates of quantum mechanics 


89 


the degeneracy of the eigenvalue A,- and let \ia), a = 1,2, , g t , be the 

orthonormal eigenkets of A. We assume that the subset of kets corresponding 
to each eigenvalue A; has been made orthogonal by the Schmidt procedure 
outlined in Section 3.3. 

If the eigenkets | ia) constitute a discrete set, we may expand the state vector 
I'P) as 

i^> = EE c ‘«i /a > ( 3 - 48 ) 

i a =1 

where the expansion coefficients c ia are 

Cia = {ia\W) (3.49) 

The expansion of the bra vector (^1 is, therefore, given by 

( X V\ = EE c %w\ (3-50) 

j /S=l 

where the dummy indices i and a have been replaced by j and [ J >. 

The expectation value of A is obtained by substituting equations (3.48) and 
(3.50) into (3.46) 

( A ) = E E E E c *(s c i«(M A \ ia ) = E E E E 4 ^^ i*«) 

j fs= 1 i a= 1 j p=l i a=l 

= EEi c '«i 2 ^' ( 3 - 51 ) 

i a= 1 

where we have noted that the kets \ ia) are orthonormal, so that 

0W a ) = bifiafS 

A comparison of equations (3.47) and (3.51) relates the probability P, to the 
expansion coefficients c ia 

^• = EM 2 = EIH^)| 2 (3.52) 

a= 1 a—l 

where equation (3.49) has also been introduced. For the case where a, is non¬ 
degenerate, the index a is not needed and equation (3.52) reduces to 

Pi = \c,\ 2 = \(i\V)\ 2 

For a continuous spectrum of eigenkets with non-degenerate eigenvalues, it 
is more convenient to write the eigenvalue equation (3.45) in the form 

A\X) =X\X) 

where X is now a continuous variable and |A) is the eigenfunction whose 
eigenvalue is A. The expansion of the state vector W becomes 
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|xp) = C (A)|A) dA 

where 

c(A) = (m 

and the expectation value of A takes the form 

(A)= |c(A)| 2 AdA (3.53) 

If d Px is the probability of obtaining a value of A between A and A + dA, then 
equation (3.47) is replaced by 

(A) = Ad P X 

and we see that 

d P x = |c(A)| 2 dA = |(A|'F)| 2 dA 
The probability d P x is often written in the form 

d Px = p(A) dA 

where p(A) is the probability density of obtaining the result A and is given by 

p(A) = |c(A)| 2 = |(A|^)| 2 

In terms of the probability density, equation (3.53) becomes 

(A) = Ap(A)dA (3.54) 

In some applications to physical systems, the eigenkets of A possess a 
partially discrete and a partially continuous spectrum, in which case equations 
(3.51) and (3.53) must be combined. 

The scalar product ( X P| X P) may be evaluated from equations (3.48) and 
(3.50) as 

(q/|W) c % c Mi°) =Y,Y. I c '«| 2 

j p=\ i a =1 i a=l 

i a— 1 i 

Since the state vector W is normalized, this expression gives 

E« = > 

i 

Thus, the sum of the probabilities Pj equals unity as it must from the definition 
of probability. For a continuous set of eigenkets, this relationship is replaced by 
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d P x = p(A) cU = l 

As an example, we consider a particle in a one-dimensional box as discussed 
in Section 2.5. Suppose that the state function Wfx) for this particle is time- 
independent and is given by 

T^x) = C sin 5 ^, 0 s£ x a 

where C is a constant which normalizes fF(x). The eigenfunctions | n) and 
eigenvalues E n of the Hamiltonian operator II are 


. . 2 . nJtx 

n) = \ - sin I- 

V a \ a 


n= 1,2, ... 


Obviously, the state function ^(x) is not an eigenfunction of H. Following the 
general procedure described above, we expand ^(x) in terms of the eigenfunc¬ 
tions | n). This expansion is the same as an expansion in a Fourier series, as 
described in Appendix B. As a shortcut we may use equations (A.39) and 
(A.40) to obtain the identity 

sin 5 0 = — (TO sin 0 — 5 sin 36 + sin 56) 

16 

so that the expansion of ^(x) is 


W(x) = — 10 sin — — 5 sin — 1 ] + sin — 1 


= ^ (10|1> - 5|3) + |5)) 

A measurement of the energy of a particle in state x V(x) yields one of three 
values and no other value. The values and their probabilities are 

h 2 „ 10 2 100 

l_ W’ 1 ~~ 10 2 + 5 2 + l 2 “126“ ' 

9h 2 5 2 

= q 2 ’ ^3=7^=0.198 

8ma z 126 


ft = - = °.oo 8 


The sum of the probabilities is unity, 

Pi + P 3 + P 5 = 0.794 + 0.198 + 0.008 = 1 

The interpretation that the quantity ^*(< 71 , q 2 , ..., t) x V(q\, q 2 , ..., t) is 
the probability density that the coordinates of the system at time t are 



92 


General principles of quantum theory 


qi, q 2 , ... may be shown by comparing equations (3.46) and (3.54) for A equal 
to the coordinate vector q 


(q) = <W|q|W) 


T P*(q) x P(q)q dr 


<q) 


p(q)q dr 


For these two expressions to be mutually consistent, we must have 

P(q) = W*(q)W(q) 

Thus, this interpretation of 'F* l F follows from postulate 3 and for this reason 
is not included in the statement of postulate 1. 


Collapse of the state function 

The measurement of a physical observable A gives one of the eigenvalues X n of 
the operator A. As stated by the fourth postulate, a consequence of this 
measurement is the sudden change in the state function of the system from its 
original form *F to an eigenfunction or linear combination of eigenfunctions of 
A corresponding to X n . 

At a fixed time t just before the measurement takes place, the ket 1^) may 
be expanded in terms of the eigenkets | ia) of A, as shown in equation (3.48). If 
the measurement gives a non-degenerate eigenvalue A„, then immediately after 
the measurement the system is in state | n). The state function V F is said to 
collapse to the function \ n). A second measurement of A on this same system, 
if taken immediately after the first, always yields the same result X n . If the 
eigenvalue X n is degenerate, then right after the measurement the state function 
is some linear combination of the eigenkets | na), a = 1, 2, ..., g n . A second, 
immediate measurement of A still yields X„ as the result. 

From postulates 4 and 5, we see that the state function W can change with 
time for two different reasons. A discontinuous change in 'F occurs when some 
property of the system is measured. The state of the system changes suddenly 
from q 1 to an eigenfunction or linear combination of eigenfunctions associated 
with the observed eigenvalue. An isolated system, on the other hand, undergoes 
a continuous change with time in accordance with the time-dependent Schro- 
dinger equation. 


Time evolution of the state function 

The fifth postulate stipulates that the time evolution of the state function V F is 
determined by the time-dependent Schrodinger equation 
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dw 

\h— = HW (3.55) 

at 

where H is the Hamiltonian operator of the system and, in general, changes 
with time. However, in this book we only consider systems for which the 
Hamiltonian operator is time-independent. To solve the time-dependent Schro- 
dinger equation, we express the state function 'Ifiq, t) as the product of two 
functions 

W(q, t) = rlJ(q)x(t) (3.56) 

where f(q) depends only on the spatial variables and %(t) depends only on the 
time. In Section 2.4 we discuss the procedure for separating the partial 
differential equation (3.55) into two differential equations, one involving only 
the spatial variables and the other only the time. The state function W(q, t) is 
then shown to be 

vd(q, t ) = if(q)e~ iEt/h (3.57) 

where E is the separation constant. Since it follows from equation (3.57) that 

|W(q, t)\ 2 = \m)\ 2 

the probability density is independent of the time t and *P(q, t) is a stationary 
state. 

The spatial differential equation, known as the time-independent Schrodin- 
ger equation, is 

Hxp(q) = Eip(q) 

Thus, the spatial function f(q) is actually a set of eigenfunctions y> n (q) of the 
Hamiltonian operator H with eigenvalues E n . The time-independent Schrodin- 
ger equation takes the form 

Hip„(q) = E n xp„{ q) (3.58) 

and the general solution of the time-dependent Schrodinger equation is 

W(q, t) = J2 c„fn(q)e-' ,E " t/h (3.59) 

n 

where c n are arbitrary complex constants. 

The appearance of the Hamiltonian operator in equation (3.55) as stipulated 
by postulate 5 gives that operator a special status in quantum mechanics. 
Knowledge of the eigenfunctions and eigenvalues of the Hamiltonian operator 
for a given system is sufficient to determine the stationary states of the system 
and the expectation values of any other dynamical variables. 

We next address the question as to whether equation (3.59) is actually the 
most general solution of the time-dependent Schrodinger equation. Are there 
other solutions that are not expressible in the form of equation (3.59)? To 
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answer that question, we assume that T ; (q, t) is any arbitrary solution of the 
parital differential equation (3.55). We suppose further that the set of functions 
q) which satisfy the eigenvalue equation (3.58) is complete. Then we can, 
in general, expand Wlq, t) in terms of the complete set ip n (q) and obtain 

(3- 6 °) 

n 

The coefficients a„(t) in the expansion are given by 

a n (t) = (</>»(q)|W(q, t)) (3.61) 

and are functions of the time t, but not of the coordinates q. We substitute the 
expansion (3.60) into the differential equation (3.55) to obtain 

E hda„(t) - v—■> hdaJt ) 

T dt ) + a "^ H ^«(q) = 7 + E n a nd) dn( q) = 0 (3.62) 

n L -I n L 

where we have also noted that the functions ip n ( q) are eigenfunctions of H in 
accordance with equation (3.58). We next multiply equation (3.62) by ip*(q), 
the complex conjugate of one of the eigenfunctions of the orthogonal set, and 
integrate over the spatial variables 

5^ + En a n ( t ) (V'*(q)|V'»(<l)) 

n L - 1 

_ >r [Mfl„(f) 1 _ hdakd) „ . A _ n 

^ i dt i dt 

n *- - 1 

Replacing the dummy index k by n, we obtain the result 

and) = c n e~ iEnt/h (3.63) 

where c n is a constant independent of both q and l. Substitution of equation 
(3.63) into (3.60) gives equation (3.59), showing that equation (3.59) is indeed 
the most general form for a solution of the time-dependent Schrodinger 
equation. All solutions may be expressed as the sum over stationary states. 

3.8 Parity operator 

The parity operator fl is defined by the relation 

riVXq) = ip(- q) (3.64) 

Thus, the parity operator reverses the sign of each cartesian coordinate. This 
operator is equivalent to an inversion of the coordinate system through the 
origin. In one and three dimensions, equation (3.64) takes the form 

n?/’(x) = f(—x), Thp(r) = flfd, y, z) = Thl'(—x, —y, —z) = ip(—r) 
The operator n 2 is equal to unity since 
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riV(q) = ri(rivXq)) = nv(-q) = VXq) 

Further, we see that 

n"?/)(q) = ?/)(q), n even 
= ip(— q), n odd 


or 


II" = 1, n even 
= FI, n odd 

The operator fl is linear and hermitian. In the one-dimensional case, the 
hermiticity of IT is demonstrated as follows 


(0 |n|v>) = 


<p*(x)tp(—x)dx = — 


(p*(—x')xp{x')dx' 


•OO 

xp(x')fl(/)*(x')dx'= (fl(/)\tp) 

— OO 

where x in the second integral is replaced by x' = —x to obtain the third 
integral. By applying the same procedure to each coordinate, we can show that 
n is hermitian with respect to multi-dimensional functions. 

The eigenvalues X of the parity operator n are given by 

n?/b(q) = ^Pa(m) (3.65) 

where ipx( q) are the corresponding eigenfunctions. If we apply n to both sides 
of equation (3.65), we obtain 

n 2 *Mq) = ^TlV^(q) = X 2 tpx( q) 

Since n 2 = 1, we see that X 2 = 1 and that the eigenvalues X, which must be 
real because IT is hermitian, are equal to either +1 or —1. To find the 
eigenfunctions rpi(q), we note that equation (3.65) now becomes 

<M-q) = ±<Mq) 

For X = 1, the eigenfunctions of fl are even functions of q, while for X = — 1, 
they are odd functions of q. An even function of q is said to be of even parity, 
while odd parity refers to an odd function of q. Thus, the eigenfunctions of TI 
are any well-behaved functions that are either of even or odd parity in their 
cartesian variables. 

We show next that the parity operator II commutes with the Hamiltonian 
operator H if the potential energy V(q) is an even function of q. The kinetic 
energy term in the Hamiltonian operator is given by 

h 2 2 h 2 (d 2 d 2 
2m 2m \dq\ dq\ 
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and is an even function of each q If the potential energy V (q) is also an even 
function of each q/ c , then we have H( q) = //(—q) and 

[H, IT]/(q) = tf(q)Il/(q) - n//(q)/(q) = H(q)f(-q) - H(-q)f(-q) = 0 

Since the function /(q) is arbitrary, the commutator of H and II vanishes. 
Thus, these operators have simultaneous eigenfunctions for systems with 
V(q) = V(-q). 

If the potential energy of a system is an even function of the coordinates and 
if q) is a solution of the time-independent Schrodinger equation, then the 
function ip(— q) is also a solution. When the eigenvalues of the Hamiltonian 
operator are non-degenerate, these two solutions are not independent of each 
other, but are proportional 

VX-q) = cf(q) 

These eigenfunctions are also eigenfunctions of the parity operator, leading to 
the conclusion that c = ± 1 . Consequently, some eigenfunctions will be of even 
parity while all the others will be of odd parity. 

For a degenerate energy eigenvalue, the several corresponding eigenfunc¬ 
tions of H may not initially have a definite parity. However, each eigenfunction 
may be written as the sum of an even part ip e (q) and an odd part ?/> 0 (q) 

VXq) = ^e(q) + V'o(q) 

where 

V'e(q) = \[f(q) + VX-q)] = ^e(-q) 

Xo(q) = KvXq) - VX-q)] = —Vo(—q) 

Since any linear combination of f(q) and v/X—q) satisfies Schrodinger’s equa¬ 
tion, the functions ip e (q) and ?/><,(q) are eigenfunctions of //. Furthermore, the 
functions f e ( q) and ip 0 {q) are also eigenfunctions of the parity operator FI, the 
first with eigenvalue +1 and the second with eigenvalue — 1. 


3.9 Hellmann-Feynman theorem 

A useful expression for evaluating expectation values is known as the Hell¬ 
mann-Feynman theorem. This theorem is based on the observation that the 
Hamiltonian operator for a system depends on at least one parameter X, which 
can be considered for mathematical purposes to be a continuous variable. For 
example, depending on the particular system, this parameter X may be the mass 
of an electron or a nucleus, the electronic charge, the nuclear charge parameter 
Z, a constant in the potential energy, a quantum number, or even Planck’s 
constant. The eigenfunctions and eigenvalues of H(X) also depend on this 
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parameter, so that the time-independent Schrodinger equation (3.58) may be 
written as 

H(l)il) n (X) = E n {XM n {X) (3.66) 

The expectation value of H(X) is, then 

E„(X) = (ip n (X)\H(X)\xp n (X)) (3.67) 

where we assume that ip n (X) is normalized 

(V»„(A)|V'„(A)) = 1 (3.68) 

To obtain the Hellmann-Feynman theorem, we differentiate equation (3.67) 
with respect to A 

^£«(A) = ^«(A) 



+ 


dA 


fnU) 


//(A) 


^«(A)) + ( V»(A) 


//(A) 


dA 


VTi(A) 


(3.69) 


Applying the hermitian property of H(X) to the third integral on the right-hand 
side of equation (3.69) and then applying (3.66) to the second and third terms, 
we obtain 


d 

dA 


E„(X) 


Va(A) 




fntt) 


+ E n (X) 




+ ( V'nCA) 


The derivative of equation (3.68) with respect to A is 




+ 






(3.70) 


showing that the last term on the right-hand side of (3.70) vanishes. We thereby 
obtain the Hellmann-Feynman theorem 


d 

dA 


E n (X) 





(3.71) 


3.10 Time dependence of the expectation value 

The expectation value (A) of the dynamical quantity or observable A is, in 
general, a function of the time t. To determine how (A) changes with time, we 
take the time derivative of equation (3.46) 
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\ / 


A 

V j + ( V 

A 



dA 

dt 




Equation (3.55) may be substituted for the time derivatives of the wave function 
to give 

/ /s v 

dA 


= \ + fa 


= 1 -{W\HA\W) + 


8t 

dA 

~di 






= ^?P\[H,A]\V) + (W 


dA 

dt 




= -([ET, A]) + 


dA 

dt 


where the hermiticity of H and the definition (equation (3.3)) of the commu¬ 
tator have been used. If the operator A is not an explicit function of time, then 
the last term on the right-hand side vanishes and we have 


d ( A ) _ 1 (rtf h\ 


(3.72) 


If we set A equal to unity, then the commutator [//, A] vanishes and equation 
(3.72) becomes 

d(J> 


dr 


= 0 


or 

2<v|i|v} = 2(ip|v) = o 

We thereby obtain the result in Section 2.2 that if W is normalized, it remains 
normalized as time progresses. 

If the operator A in equation (3.72) is set equal to H, then again the 
commutator vanishes and we have 

d (A) d (H) d E Q 

dr dr dr 

Thus, the energy E of the system, which is equal to the expectation value of the 
Hamiltonian, is conserved if the Hamiltonian does not depend explicitly on 
time. 

By setting the operator A in equation (3.72) equal first to the position 
variable x, then the variable y, and finally the variable z, we can show that 
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or, in vector notation 




which is one of the Ehrenfest theorems discussed in Section 2.3. The other 
Ehrenfest theorem, 

d(p) 


d t 


= — <VF(r)> 


may be obtained from equation (3.72) by setting A successively equal to p x 
p y , and p z . 


3.11 Heisenberg uncertainty principle 

We have shown in Section 3.5 that commuting hermitian operators have 
simultaneous eigenfunctions and, therefore, that the physical quantities asso¬ 
ciated with those operators can be observed simultaneously. On the other hand, 
if the hermitian operators A and B do not commute, then the physical 
observables A and B cannot both be precisely determined at the same time. We 
begin by demonstrating this conclusion. 

Suppose that A and B do not commute. Let a t and (>, be the eigenvalues of A 
and B, respectively, with corresponding eigenstates \a,) and |/?,•) 

A\ctLi) = a;|a,-) (3.73a) 

m)=m) (3.73b) 

Some or all of the eigenvalues may be degenerate, but each eigenfunction has a 
unique index i. Suppose further that the system is in state | aj), one of the 
eigenstates of A. If we measure the physical observable A, we obtain the result 
(i j. What happens if we simultaneously measure the physical observable B1 To 
answer this question we need to calculate the expectation value (B) for this 
system 

(B) = (aj\B\aj) (3.74) 

If we expand the state function |a 7 ) in terms of the complete, orthonormal set 

I Pt) 

I aj) = J2 c 'l W 

i 

where c, are the expansion coefficients, and substitute the expansion into 
equation (3.74), we obtain 
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<«) = EE = E E = e 

i k i k i 

where (3.73b) has been used. Thus, a measurement of B yields one of the many 
values ^ with a probability |c/| 2 . There is no way to predict which of the values 
ff will be obtained and, therefore, the observables A and B cannot both be 
determined concurrently. 

For a system in an arbitrary state 'F, neither of the physical observables A 
and B can be precisely determined simultaneously if A and B do not commute. 
Let A A and A B represent the width of the spread of values for A and B, 
respectively. We define the variance (AA) 2 by the relation 

(A A) 2 = ((A - (A)) 2 ) (3.75) 

that is, as the expectation value of the square of the deviation of A from its 
mean value. The positive square root A A is the standard deviation and is called 
the uncertainty in A. Noting that (A) is a real number, we can obtain an 
alternative expression for (Ad) 2 as follows: 

(A^) 2 = ((A - (A)) 2 ) = (A 2 - 2(A)A + (A) 2 ) 

= (A 2 ) - 2(A)(A) + (A) 2 = (A 2 ) - <d> 2 (3.76) 

Expressions analogous to equations (3.75) and (3.76) apply for (A B) 2 . 

Since A and B do not commute, we define the operator C by the relation 

[A, B] = AB - BA = iC (3.77) 

The operator C is hermitian as discussed in Section 3.3, so that its expectation 
value (C) is real. The commutator of A — (A) and B — {B) may be expanded 
as follows 

[i - (A), B - (B)] = (A - (A))(B - (. B » - (B - (B))(A - (A)) 

= AB - BA = iC (3.78) 

where the cross terms cancel since (A) and (B) are numbers and commute with 
the operators A and B. We use equation (3.78) later in this section. 

We now introduce the operator 

A - (A) + iA(B - (B)) 

where X is a real constant, and let this operator act on the state function 

[A - (A) + \l(B - (5))]W 

The scalar product of the resulting function with itself is, of course, always 
positive, so that 

{[A - (A) + il(B - (5»FP|[i - (A) + \l(B - {B))YV) > 0 (3.79) 

Expansion of this expression gives 
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((A - (A))Wj(A - (A))W) + X 2 ((B - (B))W\(B - (B))W) 

+ iX((A - (^)) X F|(5 - {B))W} - \X((B - {B)YV\(A - (^4))^) 5s 0 
or, since A and B are hermitian 
(W|(i - (H)) 2 ^) + 2 2 <W|(5 - <5» 2 |'F> 

+ U('P|[i-<H),5-(5)]|W) >0 


Applying equations (3.75) and (3.78), we have 

(A A) 2 + X 2 (AB) 2 -X(C) 3= 0 

If we complete the square of the terms involving X, we obtain 


(A A) 1 + (A BY X - 


<C) 


(Q 2 

4(A B) 2 


0 


2(A Bf 

Since X is arbitrary, we select its value so as to eliminate the second term 

i <o 

2(A B) 2 

thereby giving 

(AA) 2 (AB) 2 

or, upon taking the positive square root, 


(3.80) 


\{C? 


AAAB 3= f|(C)| 


Substituting equation (3.77) into this result yields 

AAAB ^ \\{[A, B])\ (3.81) 

This general expression relates the uncertainties in the simultaneous measure¬ 
ments of A and B to the commutator of the corresponding operators A and B 
and is a general statement of the Heisenberg uncertainty principle. 


Position-momentum uncertainty principle 

We now consider the special case for which A is the variable x (A = x) and B 
is the momentum p x (B = —i ii d/dx). The commutator [A, B] may be evaluated 
by letting it operate on W 


so that \ {[A, 5]) | =h and equation (3.81) gives 

A A ^ 

AxAp x 3= - 


= 


(3.82) 


The Heisenberg position-momentum uncertainty principle (3.82) agrees 
with equation (2.26), which was derived by a different, but mathematically 
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equivalent procedure. The relation (3.82) is consistent with (1.44), which is 
based on the Fourier transform properties of wave packets. The difference 
between the right-hand sides of (1.44) and (3.82) is due to the precise definition 
(3.75) of the uncertainties in equation (3.82). 

Similar applications of equation (3.81) using the position-momentum pairs 
y, p y and z, p- yield 

A A ^ A A ^ 

AyApy 3= -, AzAp z — 

Since x commutes with the operators p y and p z , y commutes with p x and p z , 
and z commutes with p x and p y , the relation (3.81) gives 

AqiApj = 0, ifj 

where q\ = x, q 2 = y, qj = z, p { = p x , p 2 = p y , /q = p z . Thus, the position 
coordinate q, and the momentum component pj for i f j may be precisely 
determined simultaneously. 


Minimum uncertainty wave packet 

The minimum value of the product AAAB occurs for a particular state 'F for 
which the relation (3.81) becomes an equality, i.e., when 

AAAB = \\{[A,B])\ (3.83) 

According to equation (3.79), this equality applies when 

[i - {A) + \X(B - (B))]W = 0 (3.84) 

where X is given by (3.80). For the position-momentum example where A = x 
and B = —ihd/dx, equation (3.84) takes the form 

for which the solution is 

rp _ CQ -(x-{x)fl2lti Q i(p x )x/h (3.85) 

where c is a constant of integration and may be used to normalize 'F. The real 
constant X may be shown from equation (3.80) to be 

h _ 2(Ax) 2 
2(A p x ) 2 h 

where the relation AxA p x = h/2 has been used, and is observed to be positive. 

Thus, the state function ^ in equation (3.85) for a particle with minimum 

position-momentum uncertainty is a wave packet in the form of a plane wave 
exp[i(/) A .)x//i] with wave number k 0 = {p x )/fi multiplied by a gaussian 
modulating function centered at (x). Wave packets are discussed in Section 
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1.2. Only the spatial dependence of W has been derived in equation (3.85). The 
state function W may also depend on the time through the possible time 
dependence of the parameters c, X, (x), and (p x ). 


Energy-time uncertainty principle 

We now wish to derive the energy-time uncertainty principle, which is 
discussed in Section 1.5 and expressed in equation (1.45). We show in Section 
1.5 that for a wave packet associated with a free particle moving in the x- 
direction the product AEAt is equal to the product AxAp x if A E and At are 
defined appropriately. However, this derivation does not apply to a particle in a 
potential field. 

The position, momentum, and energy are all dynamical quantities and 
consequently possess quantum-mechanical operators from which expectation 
values at any given time may be determined. Time, on the other hand, has a 
unique role in non-relativistic quantum theory as an independent variable; 
dynamical quantities are functions of time. Thus, the ‘uncertainty’ in time 
cannot be related to a range of expectation values. 

To obtain the energy-time uncertainty principle for a particle in a time- 
independent potential field, we set A equal to // in equation (3.81) 

(AE)(AB) ^ i| ([H, B])\ 

where A E is the uncertainty in the energy as defined by (3.75) with A = H. 
Substitution of equation (3.72) into this expression gives 


(AE)(AB) > | 


d (B) 

d t 


(3.86) 


In a short period of time At, the change in the expectation value of B is given 
by 


A B 


d (B) 

d t 


At 


When this expression is combined with equation (3.86), we obtain the desired 
result 


(AE)(At) 3= | (3.87) 

We see that the energy and time obey an uncertainty relation when At is 
defined as the period of time required for the expectation value of B to change 
by one standard deviation. This definition depends on the choice of the 
dynamical variable B so that At is relatively larger or smaller depending on 
that choice. If d(B)/dt is small so that B changes slowly with time, then the 
period At will be long and the uncertainty in the energy will be small. 
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Conversely, if B changes rapidly with time, then the period A t for B to change 
by one standard deviation will be short and the uncertainty in the energy of the 
system will be large. 


Problems 

3.1 Which of the following operators are linear? 

(a) f (b) sin (c) xlf (d) D x x 

3.2 Demonstrate the validity of the relationships (3.4a) and (3.4b). 

3.3 Show that 

[A, [B, C]] + [B, [C, A]] + [C, [A, B]] = 0 
where A, B. and C are arbitrary linear operators. 

3.4 Show that ( D x + x)( D x — x) — b\ — x 2 — 1. 

3.5 Show that xer x is an eigenfunction of the linear operator (D 2 — 4x 2 ). What is 
the eigenvalue? 

3.6 Show that the operator Z) 2 is hermitian. Is the operator i D 2 hermitian? 

3.7 Show that if the linear operators A and B do not commute, the operators 
(AB + BA) and i [A, B] are hermitian. 

3.8 If the real normalized functions /(r) and g( r) are not orthogonal, show that their 
sum /(r) + g(r) and their difference /(r) — g(r) are orthogonal. 

3.9 Consider the set of functions xp\ — e~ x / 2 , ip 2 — xq~ x I 2 , xp^, — x 2 e~ x / 2 , xp 4 — 
x 3 e x / 2 , defined over the range 0 =£ x ^ 00 . Use the Schmidt orthogonalization 
procedure to construct from the set y, an orthogonal set of functions with 
w(x) — 1. 

3.10 Evaluate the following commutators: 

(a) [x, p x ] (b) [x, p 2 x ] (c) [x, H] (d) [p x , H] 

3.11 Evaluate [x, p\ ] and [x 2 , p 2 ] using equations (3.4). 

3.12 Using equation (3.4b), show by iteration that 

[x", j> x ] = i finx"- 1 

where n is a positive integer greater than zero. 

3.13 Show that 

r/Y . - , •> d/(x) 

lf(x), Px 1 = th —— 

dx 

3.14 Calculate the expectation values of x, x 2 , p, and p 2 for a particle in a one¬ 
dimensional box in state ip n (see Section 2.5). 

3.15 Calculate the expectation value of p 4 for a particle in a one-dimensional box in 
state f n . 

3.16 A hermitian operator A has only three normalized eigenfunctions y,’ 1 , i/-’ 2 , V ; 3 > 
with corresponding eigenvalues a\ = 1, aj—2, a-, — 3. respectively. For a 
particular state (j) of the system, there is a 50% chance that a measure of A 
produces a\ and equal chances for either 02 or c/ 3 . 
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(a) Calculate (A). 

(b) Express the nonnalized wave function (j> of the system in terms of the 
eigenfunctions of A. 

3.17 The wave function 'E( r) for a particle in a one-dimensional box of length a is 

W(x) = C sin 7 ^; 0 x a 

where C is a constant. What are the possible observed values for the energy and 
their respective probabilities? 

3.18 If )/.’} is an eigenfunction of H with eigenvalue E, show that for any operator A 
the expectation value of [H, A] vanishes, i.e., 

(ip\[H,A]\xp) =0 

3.19 Derive both of the Ehrenfest theorems using equation (3.72). 

3.20 Show that 

AH Ax 3= ^ (p x ) 

2m 



4 

Harmonic oscillator 


In this chapter we treat in detail the quantum behavior of the harmonic 
oscillator. This physical system serves as an excellent example for illustrating 
the basic principles of quantum mechanics that are presented in Chapter 3. The 
Schrodinger equation for the harmonic oscillator can be solved rigorously and 
exactly for the energy eigenvalues and eigenstates. The mathematical process 
for the solution is neither trivial, as is the case for the particle in a box, nor 
excessively complicated. Moreover, we have the opportunity to introduce the 
ladder operator technique for solving the eigenvalue problem. 

The harmonic oscillator is an important system in the study of physical 
phenomena in both classical and quantum mechanics. Classically, the harmonic 
oscillator describes the mechanical behavior of a spring and, by analogy, other 
phenomena such as the oscillations of charge flow in an electric circuit, the 
vibrations of sound-wave and light-wave generators, and oscillatory chemical 
reactions. The quantum-mechanical treatment of the harmonic oscillator may 
be applied to the vibrations of molecular bonds and has many other applica¬ 
tions in quantum physics and field theory. 


4.1 Classical treatment 

The harmonic oscillator is an idealized one-dimensional physical system in 
which a single particle of mass m is attracted to the origin by a force F 
proportional to the displacement of the particle from the origin 

F = -kx (4.1) 

The proportionality constant k is known as the force constant. The minus sign 
in equation (4.1) indicates that the force is in the opposite direction to the 
direction of the displacement. The typical experimental representation of the 
oscillator consists of a spring with one end stationary and with a mass m 
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attached to the other end. The spring is assumed to obey Hooke s law, that is to 
say, equation (4.1). The constant k is then often called the spring constant. 

In classical mechanics the particle obeys Newton’s second law of motion 


d 2 x 

F = ma = m —r 
d t 2 


(4.2) 


where a is the acceleration of the particle and t is the time. The combination of 
equations (4.1) and (4.2) gives the differential equation 

d 2 x k 

dt 2 m 


for which the solution is 


x = A sin(2jTvt + b) = A sinfiw + b) (4.3) 


where the amplitude A of the vibration and the phase b are the two constants 
of integration and where the frequency v and the angular frequency o> of 
vibration are related to k and m by 


a) = 2jtv = 


(4.4) 


V m 

According to equation (4.3), the particle oscillates sinusoidally about the origin 
with frequency v and maximum displacement ±A. 

The potential energy Fof a particle is related to the force F acting on it by 
the expression 


F = — 


dV 

dx 


Thus, from equations (4.1) and (4.4), we see that for a harmonic oscillator the 
potential energy is given by 

V = \kx 2 = jinarx 2 (4.5) 

The total energy E of the particle undergoing harmonic motion is given by 
E = \mv 2 + V = \mv 2 + ^marx 2 (4.6) 

where v is the instantaneous velocity. If the oscillator is undisturbed by outside 
forces, the energy E remains fixed at a constant value. When the particle is at 
maximum displacement from the origin so that x = ±A, the velocity v is zero 
and the potential energy is a maximum. As |xj decreases, the potential 
decreases and the velocity increases keeping E constant. As the particle crosses 
the origin (x = 0), the velocity attains its maximum value V = sJlEjm. 

To relate the maximum displacement A to the constant energy E, we note 
that when x = ±A, equation (4.6) becomes 

E = 4 met) 2 A 2 


so that 
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A 



Thus, equation (4.3) takes the form 


x 



sin(a>r + b ) 


By defining a reduced distance y as 


y = oj 



(4.7) 


(4.8) 


so that the particle oscillates between y = — 1 and y = 1, we may express the 
equation of motion (4.7) in a universal form that is independent of the total 
energy E 

y(t) = sin(ojt + b) (4.9) 


As the particle oscillates back and forth between y = — 1 and y = 1, the 
probability that it will be observed between some value y and y + dy is 
P(y) dy, where P(y) is the probability density. Since the probability of finding 
the particle within the range — 1 s£ y 1 is unity (the particle must be some¬ 
where in that range), the probability density is normalized 

•l 

P(y) dy = i 

j-i 

The probability of finding the particle within the interval dy at a given distance 
y is proportional to the time dr spent in that interval 

dr 

P(y) dy = cdr = c— dv 

dy 


so that 


P(y) = C T~ 

d y 

where c is the proportionality constant. To find P(y), we solve equation (4.9) 
for r 


r(y) = — [sin 1 (y) - b] 

CO 

and then take the derivative to give 

P(y) = -(1 - y 2 )" 1/2 = -(I - y 2 )- 1/2 (4.10) 

Ct) Jt 

where c was determined by the normalization requirement. The probability 
density P(y) for the oscillating particle is shown in Figure 4.1. 
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Figure 4.1 Classical probability density for an oscillating particle. 


4.2 Quantum treatment 

The classical Hamiltonian H(x, p) for the harmonic oscillator is 

2 2 

H(x, p) = ^-+ V(x) = + \ma) 2 x 2 (4.11) 

2m 2m 

The Hamiltonian operator H(x, p) is obtained by replacing the momentum p 
in equation (4.11) with the momentum operator p = —if? d/dx 

i .2 f /2 c \2 


Pt U 1 99 'I u , 99 

H = --h kmoj x = — -—— r + hnco x 

2 m 2m ax 1 


(4.12) 


The Schrodinger equation is, then 
h 2 d 2 tp(x) 


f \ma) 2 x 2 'ip(x ) = Erjj(x) 


2 m dx 2 ' 2 

It is convenient to introduce the dimensionless variable § by the definition 


(4.13) 


/ ma>\ l / 2 


so that the Hamiltonian operator becomes 

hto 


H ' ^ d § 2 


(4.14) 


(4.15) 


Since the Hamiltonian operator is written in terms of the variable § rather than 
x, we should express the eigenstates in terms of § as well. Accordingly, we 
define the functions 0(£) by the relation 


1/4 

0 ( 1 ) = ( — ) V(x) 


-) 

into J 


(4.16) 


If the functions ip(x) are normalized with respect to integration over x 
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•00 

|?/i(x)| 2 dx = 1 

— OO 

then from equations (4.14) and (4.16) we see that the functions 0(£) are 
normalized with respect to integration over § 

*oo 

| 0(£)| 2 d £ = 1 

J—OO 


The Schrodinger equation (4.13) then takes the form 


d 2 0(§) 






(4.17) 


Since the Hamiltonian operator is hermitian, the energy eigenvalues E are real. 

There are two procedures available for solving this differential equation. The 
older procedure is the Frobenius or series solution method. The solution of 
equation (4.17) by this method is presented in Appendix G. In this chapter we 
use the more modem ladder operator procedure. Both methods give exactly the 
same results. 


Ladder operators 

We now solve the Schrodinger eigenvalue equation for the harmonic oscillator 
by the so-called factoring method using ladder operators. We introduce the 
two ladder operators a and a J by the definitions 


a 


a = 


1/2 


mco\ 

2 h) 


ma>\ 1//2 

2/TJ 


x + 


x — 


1 p 


mco 

i P 
mco 


~ § + 


V2 


di 


V2^ dl. 


(4.18a) 


(4.18b) 


Application of equation (3.33) reveals that the operator fit is the adjoint of fi, 
which explains the notation. Since the operator a is not equal to its adjoint fit, 
neither fi nor fit is hermitian. (We follow here the common practice of using a 
lower case letter for the harmonic-oscillator ladder operators rather than our 
usual convention of using capital letters for operators. ) We readily observe that 


1 

aa' = - 
2 

1 

a'a = - 


? d£ 2 

5 d § 2 


H 1 

hco 2 


H 

hco 


1 

2 


from which it follows that the commutator of fi and fit is unity 

[fi, fit] = fifij — fitfi = 1 

We next define the number operator N as the product fitfi 


(4.19a) 


(4.19b) 


(4.20) 
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N = a 1 a (4.21) 

The adjoint of N may be obtained as follows 

N ^ = ( a^a )t = = cfid = N 

where the relations (3.40) and (3.37) have been used. We note that N is self- 
adjoint, making it hermitian and therefore having real eigenvalues. Equation 
(4.19b) may now be written in the form 

H = ha)(N + i) (4.22) 

Since H and N differ only by the factor fun and an additive constant, they 
commute and, therefore, have the same eigenfunctions. 

If the eigenvalues of N are represented by the parameter A and the 
corresponding orthonormal eigenfunctions by 4>u(£,) or, using Dirac notation, 
by | Xi), then we have 

N\Xi) = X\Xi) (4.23) 

and 

H\Xi) = fia)(N + ±)| Xi) = hto(X + i)| Xi) = E x \Xi) (4.24) 

Thus, the energy eigenvalues E x are related to the eigenvalues of N by 

E x = (X + \)h(D (4.25) 

The index i in | Xi) takes on integer values from 1 to g x , where g x is the 
degeneracy of the eigenvalue X. We shall find shortly that each eigenvalue of N 
is non-degenerate, but in arriving at a general solution of the eigenvalue 
equation, we must initially allow for degeneracy. 

From equations (4.20) and (4.21), we note that the product of N and either a 
or a t may be expressed as follows 

Na = ci^cia = (aa J — l)a = a(a^a — 1) = a(N — 1) (4.26a) 

No 1 = a^aa^ = a\a*a + 1) = cE{N + 1) (4.26b) 

These identities are useful in the following discussion. 

If we let the operator N act on the function a\Xi), we obtain 

Na\Xi) = a(N — 1)| Xi) = ci(X — l)|Az) = (X — l)a|Az) (4.27) 

where equations (4.23) and (4.26a) have been introduced. Thus, we see that 
a|Az) is an eigenfunction of N with eigenvalue A — 1. The operator a alters the 
eigenstate |Az) to an eigenstate of N corresponding to a lower value for the 
eigenvalue, namely A — 1. The energy of the oscillator is thereby reduced, 
according to (4.25), by hco. As a consequence, the operator a is called a 
lowering operator or a destruction operator. 

Letting N operate on the function cf\/.i) gives 
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Na ] \Xi) = a\N + l)|Ai) = a\X + l)|Ai) = (A + l)a f |A/> (4.28) 

where equations (4.23) and (4.26b) have been used. In this case we see that 
aX\Xi) is an eigenfunction of N with eigenvalue A + 1. The operator a' changes 
the eigenstate |Az) to an eigenstate of N with a higher value, A + 1, of the 
eigenvalue. The energy of the oscillator is increased by hto. Thus, the operator 
a' is called a raising operator or a creation operator. 


Quantization of the energy 

In the determination of the energy eigenvalues, we first show that the 
eigenvalues X of N are positive (A 3= 0). Since the expectation value of the 
operator N for an oscillator in state |Az) is A, we have 

(Az|IV|Az) = A(Az|Az) = A 

The integral (Az| /V|Az) may also be transformed in the following manner 


(Xi\N\Xi) = (Az|a^a|Az) 


(atjrfjCa^xj) dr 


\a(pn\ 2 dr 


The integral on the right must be positive, so that A is positive and the 
eigenvalues of N and H cannot be negative. 

For the condition A = 0, we have 

\a(p h \ 2 dv = 0 


which requires that 

a|(A = 0)z) = 0 (4.29) 

For eigenvalues A greater than zero, the quantity a|Az) is non-vanishing. 

To find further restrictions on the values of A, we select a suitably large, but 
otherwise arbitrary value of A, say ?/, and continually apply the lowering 
operator a to the eigenstate | iji), thereby forming a succession of eigenvectors 

a\iji), a 2 \rji), a 3 |?;z), ... 

with respective eigenvalues rj — rj — 2, rj — 3, ... We have already shown 
that if \iji) is an eigenfunction of N, then a\rji) is also an eigenfunction. By 
iteration, if a\rji) is an eigenfunction of N, then a 2 \rji) is an eigenfunction, and 
so forth, so that the members of the sequence are all eigenfunctions. Eventually 
this procedure gives an eigenfunction a k \rji) with eigenvalue (rj — k), A being a 
positive integer, such that 0 ^ (rj — k) < 1. The next step in the sequence 
would yield the eigenfunction a k+1 \rji) with eigenvalue X = (rj — k — 1) < 0, 
which is not allowed. Thus, the sequence must terminate by the condition 

a k+1 \iji) = a[a k \rji)] = 0 
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The only circumstance in which a operating on an eigenvector yields the 
value zero is when the eigenvector corresponds to the eigenvalue X = 0, as 
shown in equation (4.29). Since the eigenvalue of d k \r/i) is >/ — k and this 
eigenvalue equals zero, we have t] — k = 0 and r/ must be an integer. The 
minimum value of X = r] — k is, then, zero. 

Beginning with X = 0, we can apply the operator o' successively to |0 i) to 
form a series of eigenvectors 

a^Oi), o^ 2 |0/), fl^ 3 |0z), ... 

with respective eigenvalues 0, 1, 2, ... Thus, the eigenvalues of the operator N 
are the set of positive integers, so that X = 0, 1,2, ... Since the value r/ was 
chosen arbitrarily and was shown to be an integer, this sequence generates all 
the eigenfunctions of N. There are no eigenfunctions corresponding to non¬ 
integral values of X. Since X is now known to be an integer n, we replace X by n 
in the remainder of this discussion of the harmonic oscillator. 

The energy eigenvalues as related to X in equation (4.25) are now expressed 
in terms of n by 

E n = (n + i )fuo , n = 0, 1,2, ... (4.30) 

so that the energy is quantized in units of hat. The lowest value of the energy or 
zero-point energy is hco/2. Classically, the lowest energy for an oscillator is 
zero. 


Non-degeneracy of the energy levels 

To determine the degeneracy of the energy levels or, equivalently, of the 
eigenvalues of the number operator N, we must first obtain the eigenvectors 
|Of) for the ground state. These eigenvectors are determined by equation (4.29). 
When equation (4.18a) is substituted for a, equation (4.29) takes the form 


or 


dtp 


0 i 


00 i 


= -§d§ 


This differential equation may be integrated to give 
0o/(l) = ce 


= -^ 2 / 2 = e ^/ V * 11 


where the constant of integration c is determined by the requirement that the 
functions 0 m (£) be normalized and e ia is a phase factor. We have used the 
standard integral (A.5) to evaluate c. We observe that all the solutions for the 
ground-state eigenfunction are proportional to one another. Thus, there exists 
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only one independent solution and the ground state is non-degenerate. If we 
arbitrarily set a equal to zero so that 0 O (£) is real, then the ground-state 
eigenvector is 

|0) = (4.31) 

We next show that if the eigenvalue n of the number operator N is non¬ 
degenerate, then the eigenvalue n + 1 is also non-degenerate. We begin with 
the assumption that there is only one eigenvector with the property that 

N\n) = n\n) 

and consider the eigenvector \(n + 1 )z), which satisfies 
N\(n + l)z) = (n + 1)| (« + l)/> 

If we operate on \(n + l)z) with the lowering operator a, we obtain to within a 
multiplicative constant c the unique eigenfunction | n), 

a\(n + 1)/) = c\n) 

We next operate on this expression with the adjoint of a to give 

a^a\(n + l)z) = N\(n + l)z) = ( n + 1)| (n + l)z) = ca^|«) 
from which it follows that 

I (n+ 1)0 =- C —ct ] \n) 

W T 1 

Thus, all the eigenvectors \(n + 1)/) corresponding to the eigenvalue n + 1 are 
proportional to a'\n) and are, therefore, not independent since they are 
proportional to each other. We conclude then that if the eigenvalue n is non¬ 
degenerate, then the eigenvalue n + 1 is non-degenerate. 

Since we have shown that the ground state is non-degenerate, we see that the 
next higher eigenvalue n = 1 is also non-degenerate. But if the eigenvalue n = 1 
is non-degenerate, then the eigenvalue n = 2 is non-degenerate. By iteration, all 
of the eigenvalues n of N are non-degenerate. From equation (4.30) we observe 
that all the energy levels E n of the harmonic oscillator are non-degenerate. 


4.3 Eigenfunctions 
Lowering and raising operations 

From equations (4.27) and (4.28) and the conclusions that the eigenvalues of N 
are non-degenerate and are positive integers, we see that a\n) and a J \n) are 
eigenfunctions of N with eigenvalues n — 1 and n + 1, respectively. Accor¬ 
dingly, we may write 

a\n) = c n \n — 1) (4.32a) 



4.3 Eigenfunctions 


115 


and 

cfi\n) = c'„\n+ 1) (4.32b) 


where c„ and c'„ are proportionality constants, dependent on the value of n, and 
to be determined by the requirement that \n — 1), |n), and \n + 1) are normal¬ 
ized. To evaluate the numerical constants c n and c' n , we square both sides of 
equations (4.32a) and (4.32b) and integrate with respect to £ to obtain 


•OO 

|b0„| 2 d£ 

J— OO 





J— OO 


(4.33a) 


and 


*oo 

\a^(p n \ 2 d§ 

— OO 


0* + i0„+i d£ 


(4.33b) 


The integral on the left-hand side of equation (4.33a) may be evaluated as 
follows 


|fl0„| 2 d§ = (a0*)(a0„)d£ = (n\d^d\n) = (n\N\n) = n 

X) J —OO 

Similarly, the integral on the left-hand side of equation (4.33b) becomes 


a^tp ,,| 2 d§ = 


(p*)(a) <fi n ) d§ = {n\acf\n) = ( n\N + l|n) = n + 1 


tU\ - 


Since the eigenfunctions are normalized, we obtain 

|c„| 2 = n, \c' n \ 2 = n + 1 

Without loss of generality, we may let c n and c' n be real and positive, so that 
equations (4.32a) and (4.32b) become 

a\n) = \J~n\n — 1) (4.34a) 


ci^\n) = V n + 1| n + 1) (4.34b) 

If the normalized eigenvector \n) is known, these relations may be used to 
obtain the eigenvectors | n — 1) and |« + 1), both of which will be normalized. 


Excited-state eigenfunctions 

We are now ready to obtain the set of simultaneous eigenfunctions for the 
commuting operators N and //. The ground-state eigenfunction |0) has already 
been determined and is given by equation (4.31). The series of eigenfunctions 
11), 12), ... are obtained from equations (4.34b) and (4.18b), which give 

\n + 1) = [2 (n + I)]” 1 / 2 ^ - -0 | n) (4.35) 

Thus, the eigenvector 11) is obtained from |0) 
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| 1 ) = 2- 1 / 2 ^ l/V£72) = 21/2^-1/4^-172 

the eigenvector |2) from |1) 

12) = - A^(2 1 /2^-i/4| e -^/2 ) = 2- 1 /2 7r -i/4 (2 ^ _ 1)e -l 2 /2 

the eigenvector |3) from |2) 

|3> = 6_ ' /2 (?- ^)<2- 1/ V 1/ W - l)e- £I/2 ) 

= - 39e- £ ' /2 

and so forth, indefinitely. Each of the eigenfunctions obtained by this procedure 
is normalized. 

When equation (4.18a) is combined with (4.34a), we have 

|//-1>- (2«) (4-36) 

Just as equation (4.35) allows one to go ‘up the ladder’ to obtain \n + 1) from 
| n), equation (4.36) allows one to go ‘down the ladder’ to obtain \n — 1) from 
| n). This lowering procedure maintains the normalization of each of the 
eigenvectors. 

Another, but completely equivalent, way of determining the series of eigen¬ 
functions may be obtained by first noting that equation (4.34b) may be written 
for the series n = 0, 1, 2, ... as follows 

|i) = a t |o) 

|2) =2" 1 / 2 a t |l) = 2” 1 / 2 (a t ) 2 |0) 

|3) = 3~ 1/2 « + |2> = (3!) _1/2 (a t ) 3 |0) 


Obviously, the expression for | n) is 

\n) = («!)- 1 / 2 (at)"|0) 

Substitution of equation (4.18b) for o' and (4.31) for the ground-state eigen¬ 
vector |0) gives 

| n) = (2”«!) -1 / 2 jr -1 / 4 e“^( 2 

This equation may be somewhat simplified if we note that 


(4.37) 



4.3 Eigenfunctions 


117 




d£. 


I - ^ e^ 2 = h - « e* 2 ^ = ^/ 2 - - e^ 2 £ e^ : 


d£ 


= — e ^ 2 / 2 — e - ^ 2 




so that 


| -|)" e " { ’ /2 = ( - 1) "' s ’ /2 ^ e “ f 

Substitution of equation (4.38) into (4.37) gives 


d" e2 

-e s 

dq" 

(4.38) 

, d" 

'-e s 

dq" 

(4.39) 


which may be used to obtain the entire set of eigenfunctions of N and II. 


Eigenfunctions in terms of Hermitepolynomials 

It is customary to express the eigenfunctions for the stationary states of the 
harmonic oscillator in terms of the Hermite polynomials. The infinite set of 
Hermite polynomials H n (^) is defined in Appendix D, which also derives many 
of the properties of those polynomials. In particular, equation (D.3) relates the 
Hermite polynomial of order n to the nth-order derivative which appears in 
equation (4.39) 

Therefore, we may express the eigenvector | n) in terms of the Hermite 
polynomial I Iff) by the relation 

}n) = 4> n (g) = (2”n\)-^ 2 7C^ 4 H n (^f 2 (4.40) 

The eigenstates f n (x) are related to the functions 0„(§) by equation (4.16), 
so that we have 


lp n (x) 



1/4 


| 2 /2 


1 = 



1/2 


X 


(4.41) 


For reference, the Hermite polynomials for n = 0 to n = 10 are listed in Table 
4.1. When needed, higher-order Hermite polynomials are most easily obtained 
from the recurrence relation (D.5). If only a single Hermite polynomial is 
wanted and the neighboring polynomials are not available, then equation (D.4) 
may be used. 
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Table 4.1. Hermite polynomials 


n 


VnCQ 


0 1 

1 2 £ 

2 4£ 2 — 2 

3 8£ 3 - 12 § 

4 16£ 4 - 48£ 2 + 12 

5 32£ 5 — 160£ 3 + 120 £ 

6 64£ 6 - 480£ 4 + 720£ 2 - 120 

7 128£ 7 - 1344£ 5 + 3360£ 3 - 1680 £ 

8 256£ 8 - 3584| 6 + 13440§ 4 - 13440£ 2 + 1680 

9 512£ 9 - 92161; 7 + 48384§ 5 - 80640£ 3 + 302401 ; 

10 1024£ 10 - 230401; 8 + 161280£ 6 - 4032001; 4 + 302400£ 2 - 30240 


The functions 0„(£) in equation (4.40) are identical to those defined by 
equation (D.15) and, therefore, form a complete set as shown in equation 
(D.19). Substituting equation (4.16) into (D.19) and applying the relation 
(C.5b), we see that the functions >p n (x) in equation (4.41) form a complete set, 
so that 

OQ 

Y tpn(x)ip„(x') = d(x - x') (4.42) 

72=0 


Physical interpretation 

The first four eigenfunctions ip„(x) for n = 0, 1, 2, 3 are plotted in Figure 4.2 
and the corresponding functions [ip n (x)] 2 in Figure 4.3. These figures also 
show the outline of the potential energy V(x) from equation (4.5) and the four 
corresponding energy levels from equation (4.30). The function [''/>«(x)] 2 is the 
probability density as a function of x for the particle in the nth quantum state. 
The quantity [rp n (x)] 2 dx at any point x gives the probability for finding the 
particle between x and x + dx. 

We wish to compare the quantum probability distributions with those 
obtained from the classical treatment of the harmonic oscillator at the same 
energies. The classical probability density P(y ) as a function of the reduced 
distance y (— 1 y 1) is given by equation (4.10) and is shown in Figure 
4.1. When equations (4.8), (4.14), and (4.30) are combined, we see that the 
maximum displacement in terms of £ for a classical oscillator with energy 
(n + ^)hoj is \]2n + 1. For § < —\j2n + 1 and § > s/2 n + 1, the classical 
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Figure 4.2 Wave functions and energy levels for a particle in a harmonic potential well. 
The outline of the potential energy is indicated by shading. 


probability for finding the particle is equal to zero. These regions are shaded in 
Figures 4.2 and 4.3. 

Each of the quantum probability distributions differs from the corresponding 
classical distribution in one very significant respect. In the quantum solution 
there is a non-vanishing probability of finding the particle outside the classi¬ 
cally allowed region, i.e., in a region where the total energy is less than the 
potential energy. Since the Flermite polynomial //„(§) is of degree n, the wave 
function f n (x) has n nodes, a node being a point where a function touches or 
crosses the x-axis. The quantum probability density [ip n (x)] 2 is zero at a node. 
Within the classically allowed region, the wave function and the probability 
density oscillate with n nodes; outside that region the wave function and 
probability density rapidly approach zero with no nodes. 
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Figure 4.3 Probability densities and energy levels for a particle in a harmonic potential 
well. The outline of the potential energy is indicated by shading. 


While the classical particle is most likely to be found near its maximum 
displacement, the probability density for the quantum particle in the ground 
state is largest at the origin. However, as the value of n increases, the quantum 
probability distribution begins to look more and more like the classical 
probability distribution. In Figure 4.4 the function [V^oOO] 2 is plotted along 
with the classical result for an energy 30.5 hco. The average behavior of the 
rapidly oscillating quantum curve agrees well with the classical curve. This 
observation is an example of the Bohr correspondence principle, mentioned in 
Section 2.3. According to the correspondence principle, classical mechanics 
and quantum theory must give the same results in the limit of large quantum 
numbers. 
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Figure 4.4 The probability density \ipw(x)\ 2 for an oscillating particle in state n — 30. 
The dotted curve is the classical probability density for a particle with the same 
energy. 


4.4 Matrix elements 


In the application to an oscillator of some quantum-mechanical procedures, the 
matrix elements of x" and p" for a harmonic oscillator are needed. In this 
section we derive the matrix elements (n'\x\n), (n'\x 2 \n), (n'\p\n), and 
(n'\p 2 \n), and show how other matrix elements may be determined. 

The ladder operators a and eft defined in equation (4.18) may be solved for x 
and for p to give 


f h Y /2 

.(mhatV 12 t 

P = \~Y~) ( a ~ a ) 


(4.43a) 

(4.43b) 


From equations (4.34) and the orthonormality of the harmonic oscillator 
eigenfunctions |n), we find that the matrix elements of a and a 1 are 

(n'\a\n) = y/n{n'\n — 1) = \fnd n \ n -\ (4.44a) 

{n'\cft\n) = Vn + l(n'\n + 1) = V n + \d n ^ n+ \ (4.44b) 

The set of equations (4.43) and (4.44) may be used to evaluate the matrix 
elements of any integral power of x and p. 

To find the matrix element (n'\x\n), we apply equations (4.43a) and (4.44) to 
obtain 





122 


h 

2 met) 


Harmonic oscillator 


so that 


(n'\x\n) = 


1/2 


((n'\a^\n) + (n'\a\n)) 


( h \ 1/2 

V 2 Hid) / ^ \f n "t" - 1(5 n',n +1 T~ 1) 


(« + l|x|«) = 


(« + l)fr 

2 mco 


(4.45a) 


/ ,i i \ / nfl 

[n — Iran = W-- 

V 2moj 

(«'|x|«) =0 for n' ^ « + 1, n — 1 
If we replace « by « — 1 in equation (4.45a), we obtain 



From equation (4.45b) we see that 

(n — l|x|«) = («|x|« — 1) 
Likewise, we can show that 

(n + l|x|«) = («|x|« + 1) 


(4.45b) 

(4.45c) 


In general, then, we have 

(«'|x|«) = (n|x|n') 

To find the matrix element (n'\p\n), we use equations (4.43b) and (4.44) to 
give 


(n'\p\n) 



1/2 

(n'\cft — 


a\n) 


so that 



( „ + ,| > l„ ) = i V /L±p“ 

(n'\p\n) = 0 for n' ^ n + 1, n — 1 
We can easily show that 


(4.46a) 

(4.46b) 


(4.46c) 
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(n'\p\n) = -(n\p\n') 

The matrix element (n'\x 2 \n) is 

(n'\x 2 \n) = -( n'\(a t + a) 2 \n) = - (n'\(a^) 2 + a^a + act^ + a 2 \n) 

2 mto 2 mto 

From equation (4.34) we have 

(a*) 2 \n) = \Jn+\c^\n + 1) = \f(n + 1 ){n + 2)|n + 2) 

a^a\n) = \fna^\n — 1) = n\n) 

aa^\n) = V n + la\n + 1) = (« + 1)|«) 

a 2 \n) = \/na\n — 1 ) = \J n(n — 1 )|« — 2 ) 


(4.47) 


so that 


(n'|x 2 |«) = -|V( n+ 1)(« + 2)<5„-,„ + 2 + (2w + 1)(5„'„ 

2/noj 


+ yV" - 1)<5„',„_ 2 ] 


We conclude that 


h 


(n + 2|x 2 |n) = («|x 2 |« + 2) = -- \j(n + l)(n + 2) (4.48a) 

2 mm 




(«|x 2 |w) =- (n + i) 


mm 


ft 


(n — 2|x 2 |n) = («|x 2 |n — 2) = - \J n(n — 1) 

2 mm 

(«'|x 2 |n) = 0, ri ^ n + 2, n, n — 2 


(4.48b) 

(4.48c) 

(4.48d) 


The matrix element (n'\p 2 \n) is obtained from equations (4.43b) and (4.47) 


( mhco\ 


{ mftm\ 


( n'\p | n) = -\—^j( n ' |(« T - «) l«> = - J <«'I(« T ) 
— a a t + a 2 |n) 


mftm 


^ 2 — ct^a 



n + l)(n + 2)(5„' j „ +2 - (2n + T)(5, 


+ sjn(n - l)d„' „_2] 


so that 
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( > \ 

~Y~) V( n + 1)(« + 2 ) (4.49a) 

(n\p 2 \n) = mhoj(n + 4) (4.49b) 

(« - AP 2 \ n ) = (n\p 2 \n - 2) = - 1) (4.49c) 

(n'\p 2 \n) =0, n' / n + 2, n, n — 2 (4.49d) 

Following this same procedure using the operators (cP ± a)*, we can find the 
matrix elements of x k and of p k for any positive integral power k. In Chapters 
9 and 10, we need the matrix elements of x 3 and x 4 . The matrix elements 
(w'|x 3 |«) are as follows: 

( h \ 3 / 2 / _ 

(n + 3|x 3 |n) = (n|x 3 |n + 3) = --) \j (n + l)(n + 2 ){n + 3) (4.50a) 

\2 mco J 


(n + l|x 3 |n) = («|x 3 |« + 1) = 3 f ^ ^ (4.50b) 

\ 2 mco J 

( nh \ 3/2 

(n — l|x 3 |w) = (n|x 3 |« — 1) = 3( -- J (4.50c) 

/ h \ 3 / 2 ,_ 

(n — 3|x 3 |«) = («|x 3 |« — 3) = ( - j \Jn{n — 1)(« — 2) (4.50d) 

(n'|x 3 |w) = 0, n' ^ n ± 1, n ± 3 (4.50e) 

The matrix elements («'|x 4 |n) are as follows 


(n + 4|x 4 |«) = (n\x 4 \n + 4) = (^~~j \!( n + IX” + 2 )(n + 3)(n + 4) 

(4.51a) 

(n + 2|x 4 |«) = («|x 4 |n + 2) = -(-) (2n + 3)\/(n + l)(w + 2) (4.51b) 

2 \ / 

<n|l4| " > = i(^) (» 2 + ” + i) (4 ' 51c) 

(n — 2|x 4 |«) = (n|x 4 |« — 2) = -(-J (2 n — 1)^«(« — 1) (4.51 d) 

(« — 4|x 4 |«) = (n|x 4 jn — 4) = ^ ^ n ( n ~ 1)(« — 2)(« — 3) 

(«'|x 4 |«) = 0, n' ^ n, n ± 2, n ± 4 


(4.5 le) 
(4.5 If) 
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4.5 Heisenberg uncertainty relation 


Using the results of Section 4.4, we may easily verify for the harmonic 
oscillator the Heisenberg uncertainty relation as discussed in Section 3.11. 
Specifically, we wish to show for the harmonic oscillator that 

AxA p 5= \fi 


where 


(Ax) 2 = ((x - (x)) 2 ) 

(A pf = (C P ~ ip)f) 


The expectation values of x and of p for a harmonic oscillator in eigenstate 
| n) are just the matrix elements (n|x|«) and (n\p\n), respectively. These matrix 
elements are given in equations (4.45c) and (4.46c). We see that both vanish, 
so that (Ax) 2 reduces to the expectation value of x 2 or (n|x 2 |/i) and ( A p) 2 
reduces to the expectation value of p 2 or (n\p 2 \n). These matrix elements are 
given in equations (4.48b) and (4.49b). Therefore, we have 

a.v= (— y /2 ( „+ u 2 

\ma>J 


A p = (mha)) l ^ 2 (n 4-i) 1 / 2 

and the product AxA p is 

AxA /> = (« + \)h 

For the ground state (n = 0), we see that the product AxA p equals the 
minimum allowed value ft/ 2. This result is consistent with the form (equation 

(3.85) ) of the state function for minimum uncertainty. When the ground-state 
harmonic-oscillator values of (x), (p), and X are substituted into equation 

(3.85) , the ground-state eigenvector |0) in equation (4.31) is obtained. For 
excited states of the harmonic oscillator, the product AxA p is greater than the 
minimum allowed value. 


4.6 Three-dimensional harmonic oscillator 

The harmonic oscillator may be generalized to three dimensions, in which case 
the particle is displaced from the origin in a general direction in cartesian 
space. The force constant is not necessarily the same in each of the three 
dimensions, so that the potential energy is 

V = \k x x 2 + \k y y 2 +1 k z z 2 = ^wj((u 2 x“ -F co 2 ,y~ + co 2 z 2 ) 
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where k x , k v , k z are the respective force constants and co x , a> y , co z are the 
respective classical angular frequencies of vibration. 

The Schrodinger equation for this three-dimensional harmonic oscillator is 


h 2 / d 2 ip d 2 ip d 2 ip 
2 m l dx 2 dy 2 dz 2 


+ \m(a) 2 x x 2 + a> 2 y 2 + a) 2 ,z 2 )ip 


Eip 


where ip(x, y, z) is the wave function. To solve this partial differential equation 
of three variables, we separate variables by making the substitution 

>p(x, y, z) = X(x) Y(y)Z(z) (4.52) 


where X(x) is a function only of the variable x, Y(v) only of y, and Z(z) only 
of z. After division by —ip(x, y, z), the Schrodinger equation takes the form 


f h 2 d 2 X 
\2 mX dx 2 


i i i 

— iyma> x x 


+ 


f h 2 d 2 Y 

\2mY d y 2 


1 2 2 
{mto y y 


+ 


f h 2 d 2 Z 
\2 mZ dz 2 


— jmto 



= E 


The first term on the left-hand side is a function only of the variable x and 
remains constant when y and z change but x does not. Similarly, the second 
term is a function only of y and does not change in value when x and z change 
but y does not. The third term depends only on z and keeps a constant value 
when only x and y change. However, the sum of these three terms is always 
equal to the constant energy E for all choices of x, y, z. Thus, each of the three 
independent terms must be equal to a constant 


h 2 d 2 X 
2 mX dx 2 

h 2 d 2 7 
2 m Y dy 2 


I 2 2 t~» 

±ma) x x — E x 
- \ma) 2 y y 2 = E y 


fi 2 d 2 Z 


.2 _2 


2 mZ dz 2 

where the three separation constants E x , E v , E z satisfy the relation 

E x + Ey + E z = E 


(4.53) 


The differential equation for X(x) is exactly of the form given by (4.13) for a 
one-dimensional harmonic oscillator. Thus, the eigenvalues E x are given by 
equation (4.30) 

En. — (tl x + s)h<l) x , 


and the eigenfunctions are given by (4.41) 


n x = 0, 1, 2, ... 
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x n ,(x) = (2 7 


Similarly, the eigenvalues for the differential equations for Y(y) and Z(z) are, 
respectively 

En r — 1 n y -f- 2 yhoiy, tiy 0, 1,2, ... 

E„ z = (n z + \)ha) z , n z = 0, 1,2,... 

and the corresponding eigenfunctions are 


F ) , v (y) = (2"' Wv !)- 1 / 2 ^^^ ' //„,(>/)e 


Z„ : (z) = (2"-/i r !r l/2 7 H n XQ e 


The energy levels for the three-dimensional harmonic oscillator are, then, 
given by the sum (equation (4.53)) 

En x ,n y ,n z = (% + + (« v + y + (w z + j)/hu z (4.54) 

The total wave functions are given by equation (4.52) 

Vn x ,n y ,nX x ’ T> z ) = (2' ,J:+ ' ! ^ + ' Iz «,!n l ,!« z !) _1/2 f — j (m T m v a> z ) 1/4 


X H n X^H n Xv)H n X^ 2W+ ^ )t2 (4.55) 

An isotropic oscillator is one for which the restoring force is independent of 
the direction of the displacement and depends only on its magnitude. For such 
an oscillator, the directional force constants are equal to one another 

k x = ky k z — k 

and, as a result, the angular frequencies are all the same 

(O x = U) v = (O z = (O 

In this case, the total energies are 
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En x ,n y ,n z = («x + » y + U : + §)&<« = (« + \)hoj (4.56) 

where n is called the total quantum number. All the energy levels for the 
isotropic three-dimensional harmonic oscillator, except for the lowest level, are 
degenerate. The degeneracy of the energy level E n is (n + 1 )(n + 2)/2. 


Problems 

4.1 Consider a classical particle of mass m in a parabolic potential well. At time t the 
displacement x of the particle from the origin is given by 

x — a sin((wt + b) 

where a is a constant and co is the angular frequency of the vibration. From this 
expression find the kinetic and potential energies as functions of time and show 
that the total energy remains constant throughout the motion. 

4.2 Evaluate the constant c in equation (4.10). (To evaluate the integral, let y — 
cos 6.) 

4.3 Show that a and a in equations (4.18) are not hermitian and that a' is the adjoint 
of a. 

4.4 The operator N = a) a is hermitian. Is the operator a a' hermitian? 

4.5 Evaluate the commutators [H, a] and [H, a'f 

4.6 Calculate the expectation value of x 6 for the harmonic oscillator in the n — 1 
state. 

4.7 Consider a particle of mass m in a parabolic potential well. Calculate the 
probability of finding the particle in the classically allowed region when the 
particle is in its ground state. 

4.8 Consider a particle of mass m in a one-dimensional potential well such that 

V(x) — \mm 2 x 2 , x 3= 0 
= oo, x < 0 

What are the eigenfunctions and eigenvalues? 

4.9 What is the probability density as a function of the momentum p of an oscillating 
particle in its ground state in a parabolic potential well? (First find the 
momentum-space wave function.) 

4.10 Show that the wave functions A„(y) in momentum space corresponding to 
<!>„{'£,) in equation (4.40) for a linear harmonic oscillator are 

poo 

A n (y) = (2jry l/2 0„(£)e~ iyi; d£ 

J — oo 

= i- n (2 n n\jr l/2 )- l / 2 e -r 2 / 2 H„(y) 

where £ = (. ma)/1i) l l 2 x and y = {mfw))~ 1 ' 2 p. (Use the generating function (D.l) 
to evaluate the Fourier integral.) 
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4.11 Using only equation (4.43b) and the fact that a' is the adjoint of a, prove that 

{n’\p\n) = ~{n\p\n') 

4.12 Derive the relations (4.50) for the matrix elements (n'\x''\n). 

4.13 Derive the relations (4.51) for the matrix elements (n’\x 4 \n). 

4.14 Derive the result that the degeneracy of the energy level E„ for an isotropic 
three-dimensional harmonic oscillator is (n + 1)(« + 2)/2. 
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Angular momentum 


Angular momentum plays an important role in both classical and quantum 
mechanics. In isolated classical systems the total angular momentum is a 
constant of motion. In quantum systems the angular momentum is important in 
studies of atomic, molecular, and nuclear structure and spectra and in studies 
of spin in elementary particles and in magnetism. 


5.1 Orbital angular momentum 


We first consider a particle of mass m moving according to the laws of classical 
mechanics. The angular momentum L of the particle with respect to the origin 
of the coordinate system is defined by the relation 

L = r X p (5.1) 


where r is the position vector given by equation (2.60) and p is the linear 
momentum given by equation (2.61). When expressed as a determinant, the 
angular momentum L is 


i J 

x y 
Px Py 


The components L x , L v , L z of the vector L are 


k 

z 

Pz 


L x = yp z - ZPy 

Ly = zp x - xp z (5.2) 


L z = xpy - ypx 

The square of the magnitude of the vector L is given in terms of these 
components by 

L 2 = L L = L 2 x + L 2 y + L 2 z (5.3) 
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If a force F acts on the particle, then the torque T on the particle is defined as 


T 


r X F = r X 


dp 

dr 


(5.4) 


where Newton’s second law that the force equals the rate of change of linear 
momentum, F = dp/dr, has been introduced. If we take the time derivative of 
equation (5.1), we obtain 


dL 

dr 



+ 




(5.5) 


since 


dr 

dr 


X P 


dr dr 

= — X m — =0 
dr dr 


Combining equations (5.4) and (5.5), we find that 


T 


dL 

dr 


(5.6) 


If there is no force acting on the particle, the torque is zero. Consequently, the 
rate of change of the angular momentum is zero and the angular momentum is 
conserved. 

The quantum-mechanical operators for the components of the orbital angular 
momentum are obtained by replacing p x , p y , p z in the classical expressions 
(5.2) by their corresponding quantum operators, 


h( d 8\ 

L x = yp z -zp y = j \^y— - z—J 


(5.7a) 


L 


y = ZPx ~ X Pz 


hfd d\ 

T \ z &c- x fc) 


(5.7b) 


L z 


h 

XPy - VPx = y 



(5.7c) 


Since y commutes with p z and z commutes with p y , there is no ambiguity 
regarding the order of y and p z and of z and p y in constructing L x . Similar 
remarks apply to L y and L z . The quantum-mechanical operator for L is 

L = i L x + j L y + k L z (5.8) 

and for Lr is 


L 2 = l-L = L 2 x + l\ + l\ (5.9) 

The operators L x , L y , L z can easily be shown to be hermitian with respect to a 
set of functions of x, y, z that vanish at ±oo. As a consequence, L and L 2 are 
also hermitian. 
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Commutation relations 

The commutator [L x , L v \ may be evaluated as follows 
[L x , Ly] = [yp z - zp y , zp x - xp z ] 

= [ypz, Zpx] + [zpy, Xp z ] - [ypz, Xp z ] - [zpy, Zpx] 

The last two terms vanish because yp z commutes with xp z and because zp y 
commutes with zp x . If we expand the remaining terms, we obtain 

[L x , Ly] = ypxpzZ - yp x zp z + xpyZp z - xpyp z z = (xpy - yp x )[z, p z ] 

Introducing equations (3.44) and (5.7c), we have 

[L x , Ly] = ihL z (5.10a) 

By a cyclic permutation of x, y, and z in equation (5.10a), we obtain the 
commutation relations for the other two pairs of operators 

[Ly, L z ] = iLL x (5.10b) 

[L z , l x ] = xhly (5.10c) 

Equations (5.10) may be written in an equivalent form as 

LXL = ifi.L (5.11) 

which may be demonstrated by expansion of the left-hand side. 


5.2 Generalized angular momentum 

In quantum mechanics we need to consider not only orbital angular momen¬ 
tum, but spin angular momentum as well. Whereas orbital angular momentum 
is expressed in terms of the x, y, z coordinates and their conjugate angular 
momenta, spin angular momentum is intrinsic to the particle and is not 
expressible in terms of a coordinate system. However, in quantum mechanics 
both types of angular momenta have common mathematical properties that are 
not dependent on a coordinate representation. For this reason we introduce 
generalized angular momentum and develop its mathematical properties 
according to the procedures of quantum theory. 

Based on an analogy with orbital angular momentum, we define a general¬ 
ized angular-momentum operator J with components J x , J y , J z 

J = i J x + j-7 y + k J z 

The operator J is any hermitian operator which obeys the relation 

J X J = if? j 


or equivalently 


(5.12) 
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[J x ,J y \ = ihJ z (5.13a) 

\j y ,J z ] = ihJ x (5.13b) 

[J z , J x \ = xhjy (5.13c) 

The square of the angular-momentum operator is defined by 

J 2 = J.J = /2 + J2 + J2 (5. 14 ) 


and is hermitian since J x , J and J z are hermitian. The operator J 2 commutes 
with each of the three operators J x , J v , We first evaluate the commutator 
[J 2 , Jz] 

[J 2 , Jz\ = [j\, Jz] + [J 2 y, Jz] + \j \, J -] 

= J X [J X , Jz] + [J X) Jz ]J X -f J y\Jy, Jz] T \Jy>> Jz\Jy 

= — i hJ x J y — ihJ y J x + i hJ y J x + i hJ x J y 

= 0 (5.15a) 

where the fact that J z commutes with itself and equations (3.4b) and (5.13) 
have been used. By similar expansions, we may also show that 

[J 2 , J x ] = 0 (5.15b) 

[J 2 ,J y ] = 0 (5.15c) 

Since the operator J 2 commutes with each of the components J x , J y , J z of 
j, but the three components do not commute with each other, we can obtain 
simultaneous eigenfunctions of J 2 and one, but only one, of the three compo¬ 
nents of J. Following the usual convention, we arbitrarily select J z and seek the 
simultaneous eigenfunctions of J 2 and J z . Since angular momentum has the 
same dimensions as h, we represent the eigenvalues of J 2 by Xfi 2 and the 
eigenvalues of J z by mh, where X and m are dimensionless and are real 
because J 2 and J z are hermitian. If the corresponding orthonormal eigenfunc¬ 
tions are denoted in Dirac notation by | Xm), then we have 

J 2 \Xm) = Xh 2 \Xm) (5.16a) 

j z \Xm) = mh 2 \Xm) (5.16b) 

We implicitly assume that these eigenfunctions are uniquely determined by 
only the two parameters X and m. 

The expectation values of J 2 and J 2 are, according to (3.46), and (5.16) 

(J 2 ) = (Xm\J 2 \Xm) = Xfi 2 


(. J 2 ) = (Xm\J 2 \Xm) = m 2 h 2 
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since the eigenfunctions | km) are normalized. Using equation (5.14) we may 
also write 

(J 2 ) = (.J \> + (J 2 y ) + (Jl) 

Since J x and J y are hermitian, the expectation values of J\ and J\ are real 
and positive, so that 

(J 2 ) > (Jl) 

from which it follows that 

0 (5.17) 


Ladder operators 

We have already introduced the use of ladder operators in Chapter 4 to find the 
eigenvalues for the harmonic oscillator. We employ the same technique here to 
obtain the eigenvalues of J 2 and J z . The requisite ladder operators J+ and J_ 
are defined by the relations 

J + = Jx “f tJy (5.18a) 

J-=J x -iJ y (5.18b) 

Neither J + nor ./_ is hermitian. Application of equation (3.33) shows that they 
are adjoints of each other. Using the definitions (5.18) and (5.14) and the 
commutation relations (5.13) and (5.15), we can readily prove the following 


relationships 

[J z ,J+\ = hJ + (5.19a) 

[J z ,J-\ = -hJ^ (5.19b) 

[J 2 ,J+] = 0 (5.19c) 

[J 2 , J_] = 0 (5.19d) 

[J+, J-] = 2hJ z (5.19e) 

J+J- = J 2 -J 2 z + hJ z (5.19f) 

jJ+ = j 2 -J 2 z -hj z (5.19g) 


If we let the operator J 2 act on the function J + \Xm) and observe that, 
according to equation (5.19c), J 2 and ./+ commute, we obtain 

J 2 J + \Xm) = J + J 2 \Xm) = Xfi 2 J+\Xm) 

where (5.16a) was also used. We note that J + \Xm) is an eigenfunction of J 2 
with eigenvalue Xfi 2 . Thus, the operator J + has no effect on the eigenvalues of 



135 


5.2 Generalized angular momentum 

J 2 because J 2 and J + commute. However, if the operator J z acts on the 
function J + \Xm), we have 

J z J+\Xm) = J + J z \Xm) + hJ + \Xm) = mhJ + \Xm) + liJ + \Xm) 

= (m + \)hJ + \Xm) (5.20) 

where equations (5.19a) and (5.16b) were used. Thus, the function J + \Xm) is 
an eigenfunction of J z with eigenvalue ( m + 1 )h. Writing equation (5.16b) as 

J Z \X, m + 1) = (m + \)h\X, m + 1) 
we see from equation (5.20) that J + \Xm) is proportional to \X, m + 1) 

j+\Xm) = c+\X, m + 1) (5.21) 

where c+ is the proportionality constant. The operator J+ is, therefore, a 
raising operator, which alters the eigenfunction | Xm) for the eigenvalue mh to 
the eigenfunction for (m + 1 )h. 

The proportionality constant c+ in equation (5.21) may be evaluated by 
squaring both sides of equation (5.21) to give 

(Xm\J-J + \Xm) = |c + | 2 (2, m + \\X, m + 1) 
since the bra (Xm\J is the adjoint of the ket J + \Xm). Using equations (5.16) 
and (5.19g) and the normality of the eigenfunctions, we have 

|c+| 2 = (Xm\J 2 — JZ — hJ z \Xm) = (X — m 2 — m)h 2 
and equation (5.21) becomes 

J + \Xm) = \JX — m(m + 1) h\X, m + 1) (5.22) 

In equation (5.22) we have arbitrarily taken c + to be real and positive. 

We next let the operators J 2 and J z act on the function./_ | Xm) to give 

j 2 J_\Xm) = J -J 2 \Xm) = Xh 2 j-\Xm) 

J z J-\Xm) = J-J : \Xm) — hj-\Xm) = (m — X)hj_\Xm) 

where we have used equations (5.16), (5.19b), and (5.19d). The function 
J-\Xm) is a simultaneous eigenfunction of J 2 and J z with eigenvalues Xfi 2 and 
( m — 1 )h, respectively. Accordingly, the function J_\Xm) is proportional to 
\X, m — 1) 

j-\Xm) = C-\X, m — 1) (5.23) 

where c_ is the proportionality constant. The operator J_ changes the eigen¬ 
function | Xm) to the eigenfunction \X, m — \) for a lower value of the eigen¬ 
value of J z and is, therefore, a lowering operator. 

To evaluate the proportionality constant c_ in equation (5.23), we square 
both sides of (5.23) and note that the bra (Xm\J + is the adjoint of the ket 
J-\Xm), giving 
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\c-\ 2 = (Xm\ J + J-\Xm) = (Xm\J 2 — J 2 + hJ z \Xm) = {X — m 2 + m)h 2 
where equation (5.19f) was also used. Equation (5.23) then becomes 

J-\Xm) = \JX — m(m — 1) h\X, m — 1) (5-24) 

where we have taken c_ to be real and positive. This choice is consistent with 
the selection above of c + as real and positive. 


Determination of the eigenvalues 

We now apply the raising and lowering operators to find the eigenvalues of J 2 
and J z . Equation (5.17) tells us that for a given value of X, the parameter m has 
a maximum and a minimum value, the maximum value being positive and the 
minimum value being negative. For the special case in which X equals zero, the 
parameter m must, of course, be zero as well. 

We select arbitrary values for X, say §, and for m, say where 0 r/ 2 £ 

so that (5.17) is satisfied. Application of the raising operator J + to the 
corresponding ket | %rj) gives the ket |£, t] + 1). Successive applications of J + 
give |£, r] + 2), |q, // + 3), etc. After k such applications, we obtain the ket 
|!/), where j = t] + k and j 2 =£ q. The value of j is such that an additional 
application of J + produces the ket | q, j + 1) with (J + 1 ) 2 > 5 (that is to say, it 
produces a ket \Xm) with m 2 > X), which is not possible. Accordingly, the 
sequence must terminate by the condition J+\^j) = 0 . From equation (5.22), 
this condition is given by 

J +\£/) = yj£~j(j+ 1 ) h \£,j+ 1 ) = 0 

which is valid only if the coefficient of 1 j + 1 ) vanishes, so that we have 

% = j(j+ !)• 

We now apply the lowering operator ./_ to the ket |q/) successively to 
construct the series of kets |£, j— 1), ||, j — 2), etc. After a total of n 
applications of J_, we obtain the ket |£/), where j' = j — n is the minimum 
value of m allowed by equation (5.17). Therefore, this lowering sequence must 
terminate by the condition 

j- m = vz-fw- 1 ) m r - 1 > = 0 

where equation (5.24) has been introduced. This condition is valid only if the 
coefficient of |£, j' — 1 ) vanishes, giving § = /'(/' — 1 ). 

The parameter § has two conditions imposed upon it 

! = ./(/+ 1 ) 

Z = jV- 1 ) 


giving the relation 
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.7(7+1) =/(/-!) 
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The solution to this quadratic equation gives / = —j. The other solution, 
/' = j + 1, is not physically meaningful because / must be less than j. We 
have shown, therefore, that the parameter m ranges from — j to j 

-j m s= j 

If we combine the conclusion that / = —j with the relation j' = j — n, we see 
that j = m/2, where n = 0, 1,2, ... Thus, the allowed values of j are the 
integers 0, 1 , 2, ... (if n is even) and the half-integers /, f , ... (if n is odd) 
and the allowed values of m are —j, —j + 1 ,..., j — 1, j. 

We began this analysis with an arbitrary value for X, namely X = §, and an 
arbitrary value for m, namely m = >/. We showed that, in order to satisfy 
requirement (5.17), the parameter § must satisfy § = j(j + 1), where j is 
restricted to integral or half-integral values. Since the value £ was chosen 
arbitrarily, we conclude that the only allowed values for X are 

A = 7(7+1) (5.25) 

The parameter t] is related to j by j = rj + k, where k is the number of 
successive applications of J + until \g>f) is transformed into |£/). Since k must 
be a positive integer, the parameter t] must be restricted to integral or half¬ 
integral values. However, the value t] was chosen arbitrarily, leading to the 
conclusion that the only allowed values of m are m = —j, — j + 1, ..., j — 1, 
j. Thus, we have found all of the allowed values for X and for m and, therefore, 
all of the eigenvalues of J 2 and J z . 

In view of equation (5.25), we now denote the eigenkets \Xm) by \jm). 
Equations (5.16) may now be written as 

J 2 \jm) = JV + l)ft 2 | jm), j = 0, 1, 2, ... (5.26a) 

J z \jm) = mh\jm), m = -j, -j + 1, ..., j - 1, j (5.26b) 

Each eigenvalue of J 2 is (2/+ l)-fold degenerate, because there are (2J + 1) 
values of m for a given value of j. Equations (5.22) and (5.24) become 

J+\jm) = \Jj{j + 1) - m(m + 1) h\j, m+ 1) 

= \/{j — m)(j + m + 1) h\j, m + 1) (5.27a) 


J-\jm) = Vt(7 + 1) - m(m ~ 1) h\j, m - 1) 

= VU+ "0(7- m + 1) h\j, m - 1) 


(5.27b) 
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5.3 Application to orbital angular momentum 

We now apply the results of the quantum-mechanical treatment of generalized 
angular momentum to the case of orbital angular momentum. The orbital 
angular momentum operator L, defined in Section 5.1, is identified with the 
operator J of Section 5.2. Likewise, the operators L 2 , L x , L y , and L z are 
identified with J 2 , J x , J y , and J~, respectively. The parameter j of Section 5.2 
is denoted by / when applied to orbital angular momentum. The simultaneous 
eigenfunctions of Lr and L- are denoted by | Im), so that we have 

L 2 \lm) = l(l+\)h 2 \lm) (5.28a) 

L z \lm) = mh\lm ), m = —l, — l + 1, ..., / — 1, l (5.28b) 

Our next objective is to find the analytical forms for these simultaneous 
eigenfunctions. For that purpose, it is more convenient to express the operators 
L x , L y , L-, and L 2 in spherical polar coordinates r, 6 , <p rather than in cartesian 
coordinates x, y, z. The relationships between r, 6 , tp and x, y, z are shown in 
Figure 5.1. The transformation equations are 


x = r sin 0 cos cp 

(5.29a) 

v = r sin 0 sin tp 

(5.29b) 

z = r cos 0 

(5.29c) 

r = (x 2 + y 2 + z 2 ) 1//2 

(5.29d) 

6 = cos _1 (z/(x 2 + y 2 + z 2 ) 1 / 2 ) 

(5.29e) 

c p = tan -1 (y/x) 

(5.29f) 


Z 



Figure 5.1 Spherical polar coordinate system. 
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These coordinates are defined over the following intervals 

—03 x, y, z =£ oc, 0 r oc, 0 *£ 6 s£ jt, 0 s£ tp s£ 2jt 

The volume element dr = dxdvdr becomes dr = r 2 sin 0 dr d 0 dtp in spheri¬ 
cal polar coordinates. 

To transform the partial derivatives d/dx, d/dy, d/dz, which appear in the 
operators L x , L v , L, of equations (5.7), we use the expressions 

d (dr\ d (dd\ d (dtp\ d 
dx \ dx ) yz dr \ dx J yz dO ' \ dx ) yz dtp 


. n d cos 0 cos tp d sinro d 

= sin 6 cos (p-zr~\ -tt^- 

dr r dO r sintf dtp 

d (dr\ d f dO\ d f dtp\ d 

dy - \dy) x , z dr + \dy) xz dO + \dy) xz dtp 

. n . d cos 0 sin tp d cos tp d 

= smOsmtp — +-ttx + — 

dr r dd r sin d dtp 

d (dr\ d (dO\ d (dtp\ d 


77 (5.30a) 


dz \dz) xy dr \dzj x<y d6 \dz) x<y dtp 
d sin 0 d 

= ms 0 d?-—ae 

Substitution of these three expressions into equations (5.7) gives 
r hf . d Q d\ 

L ’ = i {- sm «’ae~ co ‘ l>coslp d v ) 


L y = - cos tp — — cot 6 sin tp 


, _h d 

i dtp 


(5.30b) 


(5.30c) 


(5.31a) 


(5.31b) 


(5.31c) 


By squaring each of the operators L x , L y , L : and adding, we find that L 1 is 
given in spherical polar coordinates by 


dO) sin 2 6 dtp 


Since the variable r does not appear in any of these operators, their eigen¬ 
functions are independent of r and are functions only of the variables 6 and tp. 
The simultaneous eigenfunctions | Im) of L 2 and L- will now be denoted by the 
function T/ m ( 0 , tp) so as to acknowledge explicitly their dependence on the 
angles 0 and tp. 
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The eigenvalue equation for L- is 

fi c) 

L Z YUG, (p) = ~— Y lm (6, <p) = mhY lm (9, <p) (5.33) 

1 dtp 

where equations (5.28b) and (5.31c) have been combined. Equation (5.33) may 
be written in the form 

d Yi m (6, (p) 

--—- = i m acp (9 held constant) 

Yim(0, cp) 

the solution of which is 

YiJO, cp) = GU6)e imq> (5.34) 

where 0 /„,( 0 ) is the ‘constant of integration’ and is a function only of the 
variable 9. Thus, we have shown that Y/ m (9, <p) is the product of two functions, 
one a function only of 9, the other a function only of <p 

Yi m (9, cp) = Q lm (9)<Y m Up) (5.35) 

We have also shown that the function O m ((p) involves only the parameter m 
and not the parameter /. 

The function O m (q>) must be single-valued and continuous at all points in 
space in order for Y/ m (9, cp) to be an eigenfunction of L 2 and L-. If O m (tp) and 
hence T/ m ( 0 , <p) are not single-valued and continuous at some point <po, then 
the derivative of Yi m (9 , <p) with respect to cp would produce a delta function at 
the point tp 0 and equation (5.33) would not be satisfied. Accordingly, we 
require that 

$ m((p) = O m(<P + 2jr) 
or 

gimip _ gim(i?)+2jr) 

so that 

g2imjr _ j 

This equation is valid only if m is an integer, positive or negative 

m = 0 , ±1, ± 2 , ... 

We showed in Section 5.2 that the parameter m for generalized angular 
momentum can equal either an integer or a half-integer. However, in the case 
of orbital angular momentum, the parameter m can only be an integer; the half¬ 
integer values for m are not allowed. Since the permitted values of m are —l, 
— I + 1 , ..., l — 1 , /, the parameter / can have only integer values in the case 
of orbital angular momentum; half-integer values for / are also not allowed. 


Ladder operators 

The ladder operators for orbital angular momentum are 
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L+ — L x + i L\ 


L— — L x tLy 


(5.36) 


and are identified with the ladder operators J + and J_ of Section 5.2. Substi¬ 
tution of (5.31a) and (5.31b) into (5.36) yields 


L + = h^[ — + icote— 


i - =fcr “'(-^ +lco,e l; 


(5.37a) 


(5.37b) 


where equation (A.31) has been used. When applied to orbital angular 
momentum, equations (5.27) take the form 

L + Y lm (6, tp) = \/(7 - m)(l + m + 1) hYp m+{ (6, (p) (5.38a) 

L- Yj m {6, tp) = \/(l+ m)(l - m+ 1) hY^ m _\{d, tp) (5.38b) 

For the case where m is equal to its minimum value, m = —l, equation 
(5.38b) becomes 

L- Yi _[(6, tp) = 0 


or 

(-^ + ico , e ^ K ' (e - <?)=0 

when equation (5.37b) is introduced. Substitution of K/,_/(0, <p) from equation 
(5.34) gives 

(- §g + i c °< » Vi-im- 11 ' = (- ^ + / cot ej = o 

Dividing by e~ lUp , we obtain the differential equation 

/ COS f) I 

d In 0/_/((?) = IcotOdd = - — d6 = --dsin0 = /din sin 0 

sin0 sin0 

which has the solution 

®f_i(0) = Ai sin 1 6 (5.39) 

where A / is the constant of integration. 


Normalization ofYi_i(6 , <p) 

Following the usual custom, we require that the eigenfunctions Y/ m (0, tp) be 
normalized, so that 
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Y*J6, (p)Y lm {6, <p) sin 6 dO dtp 


= ©l n (0)@ lm (6) sin 6 d 0 O l((p)0 m (q>) dtp = 1 

Jo Jo 

where the 6- and (^-dependent parts of the volume element dr are included in 
the integration. For convenience, we require that each of the two factors 0 i m (0 ) 
and O „,((?) be normalized. Writing O m (q>) as 

<5>m((p) = Ae im<p 


we find that 


t tin 

(Ae im<p )*(Ae im<l, )d(p = \A\ 2 dtp 

Jo 

A = e m /V2jr 


giving 


®m((p) = 


(5.40) 


where we have arbitrarily set a equal to zero in the phase factor e ia associated 
with the normalization constant. 

The function 0/_/(0) is given by equation (5.39) and the value of the 
constant of integration A / is determined by the normalization condition 

*7T pJT 

[0/,_/(0)]*©/,_/(0) sintfdtf = \A ,\ 2 sin 2/+1 6»d@ = 1 (5.41) 

. o Jo 

We need to evaluate the integral // 


//= sin + 6>d$ = — (1 —/r) d/i = (T — pc) dpi 

Jo Ji J-i 

where we have defined the variable ft by the relation 

f-i = cos 0 (5.42) 

so that 

1 — fi 2 = sin 2 0, d/i = —sin0d0 
The integral // may be transformed as follows 

// = [ (T - dfi - [ (1 - ( m 2 ) / “V 2 dfi = 11— i + [ ^ d(l - fi 2 ) 


= //_, - 




where we have integrated by parts and noted that the integrated term vanishes. 
Solving for //, we obtain a recurrence relation for the integral 
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21 

h = ^, h-\ 


21 + 1 


143 

(5-43) 


Since /q is given by 


/n = 


d/r = 2 


-l 


we can obtain // by repeated application of equation (5.43) starting with 7 0 
(2/)(2/ - 2)(2 1 - 4) • • • 2 2 2/+1 (/!) 2 

1 ~ (21 + 1)(2/ - 1)(2/ - 3) • • • 3 7 ° ~~ (21 + 1)! 
where we have noted that 

(2/)(2/ - 2) ••• 2 = 2 'll 
(21 + 1)(2/ — l)(2/-3) ••• 3 

(2/ + 1)(20(2/ - 1 )(2/ - 2)(2/ - 3) • • • 3 X 2 X 1 (2/ + 1)! 


(2 0 ( 2 / - 2 ) • • • 2 

Substituting this result into equation (5.41), we find that 


20 ! 


M/I = 


20 ! 


1 (21 + 1)! 


It is customary to let a equal zero in the phase factor e ia for 0/_/(0), so that 

(5.44) 




1 /(27+1)! . , 


sin 0 


Combining equations (5.35), (5.40) and (5.44), we obtain the normalized 
eigenfunction 


Y L -t(d, cp) = ^ 


1 • A 2/+ 1 ) ! s i n / 6» e - i/,p (5.45) 


4 Jt 


Spherical harmonics 

The functions Yj m (0, cp) are known as spherical harmonics and may be 
obtained from 7/ _/(0, <p) by repeated application of the raising operator L + 
according to (5.38a). By this procedure, the spherical harmonics T/_/ + i(0, cp), 
Yi- 1 + 2 ( 6 , tp), ..., Yi-\( 6 , tp), Yio( 6 ,(p), 7/i(0, <p), ..., Yu( 6 , (p ) may be 
determined. Since the starting function 7/,_/(0, <p) is normalized, each of the 
spherical harmonics generated from equation (5.38a) will also be normalized. 

We may readily derive a general expression for the spherical harmonic 
Y/ m (0, tp) which results from the repeated application of L+ to 7/,_/(0, <p). We 
begin with equation (5.38a) with m set equal to — / 


7/-/+1 


(5.46) 
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For m equal to — / + 1, equation (5.38a) gives 

1 r 

Yl-l+2 = ^+Yl,-l+l = 


1 


■ L\Y 


y/2(21- 1 ) h ’ a / 2 ( 2 /)( 2 / — 1 ) fi 2 

where equation (5.46) has been introduced in the last term. If we continue in 
the same pattern, we find 

1 ? 1 

Yl-l+3 = L + Yi-1 +2 = ■ 


■v/3(2/-2) ft 


a/2 • 3(20(2/ - 1)(2/ - 2) ft 3 




(21 - k)\ 1 *, 

where A* is the number of steps in this sequence. We now set k = l + m in the 
last expression to obtain 


kIni — 


I (l-m)\ 1 > /+w 

(/+ m)!(2/)!^ /+m + 7 ’ 1 


(5.47) 


If the number of steps k is less than the value of /, then the integer m is 
negative; if k equals /, then m is zero; if k is greater than /, then m is positive; 
and finally if k equals 21, then m equals its largest value of /. 

The next step in this derivation is the evaluation of Z/ +m Yi-i using equation 
(5.37a). If the operator L + in (5.37a) acts on T/ _/(0, <p) as given in (5.45), we 
have 

L+Yj-i = c/he 1(p ^^ + i cot d-^j sin 'de~ llcp 


= c/he 1(1 1)<P Q)) + /cot0^ sin^ 
= (2 + /cote) ^2 


= —c/Aie _i(/_1)<p 


1 


sin 2/ 0 — / sin' ^cos 0 +/sin' '6/cos(Z 


J -1 


shfi# d 6 
1 d 


/-i 


sin /_1 0d(cos 6) 
where for brevity we have defined c/ as 

1 


sin 2/ 0 


ci = 


2 l l\ 


( 2 1 + 1 )! 

4jt 


(5-48) 
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We then operate on this result with L + to obtain 


LjYt-i = -c/h e 1<? — + icot# — 


d\f e _i(/_1)< P d 


dtpJ \ sin 7 ” 1 # d(cos#) 


sin - '# 


= —CjflG 


2 ~—i(l—2)<p) 


—+ (/- 1) cot # 


sin /_1 #d(cos #) 


sin 2 '# 


= c,fi 2 e 1(1 2)<f> , 1 d sin 2 '# 
sm /_2 # d(cos #) 2 


After k such applications of L + to the function 7/ _/(#, (?), we have 

Z.*7,_, = d ' , sin 2 '# 

+ ’ 2 sm'-^edlcos#)* 

If we set A' = / + m in this expression, we obtain the desired result 

(4/+W 

L\ fm 7 / _ / = (—h) l+ ' n cie imq> sin"'#-— sin 2 '# (5.49) 

+ ’ d(cos#) /+m 

The general expression for 7/ m (#, <p) is obtained by substituting equation 
(5.49) into (5.47) with q given by equation (5.48) 


sin 2 '# (5.49) 


7/m(#, (P) = ^7 


(-l)' +m 7(2/ +!)(/ — iw)! 


-7^77-(je i "" p sin"'#--—— sin 2 '# 

4?r (/ + m)! d(cos #)'+"' 


(5.50) 

When 7/ m (#, (?) is decomposed into its two normalized factors according to 
equations (5.35) and (5.40), we have 


QUO) = 


(_l)/+ m /(2/ + !)(/ — m)\ . 


-^jsin"'# —-——sin 2 '# (5.51) 

2 (l + m)\ d(cos#) /+m 




(5.52) 


The spherical harmonics for / = 0, 1, 2, 3 are listed in Table 5.1. We note 
that the function 0/_„,(#) is related to ©/,„(#) by 

0/,-m(#) = ( — l) m 0/m(#) (5.53) 

and that the complex conjugate 7* m (#, <p) is related to 7/ m (0, (?) by 

Y*U0, <?) = (-lm.* (5.54) 

Because both Lr and L : are hermitian, the spherical harmonics 7/ m (#, (?) 
form an orthogonal set, so that 


r pjr 

Jo 


c?)7/ m (#, (?)sin#d#d(? = (5//’Ch 


(5.55) 
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Table 5.1. Spherical harmonics Yj m {6, cp) for l = 0, 1, 2, 3 


Too 


-(i) 


Tio = U- cos 9 


10 \ . 
v 4 JT 

Ti,±i = T 
T 20 — 

T 2 ,±i = T 
T2+2 


1/2 


1/2 


3 

8 tt 


1/2 


sin 


9 e ±i<p 


1/2 


16ji ) 
8k) 

= <—) 

32 Ji) 


(3 cos 2 6 — 1) 

1/2 


1/2 


sin 2 8 e ±2up 


= (jS 12 


\16kJ 

= Jl LI 


T30 

T 3 .ii = T (■ 

T3,i2 = 

T 3 ,i 3 = T 


(5 cos 3 0 — 3 cos 0) 

1/2 


\6AjtJ 
105 
32 71 


sin 0(5 cos 2 6 — l)e ±,ip 


1/2 


= x( 2 T) 

64 jt. 


sin 2 9 cos 9 e ±2l<p 

1/2 

sin 3 0 e ±3i ” 


sin 9 cos 9 e ±1<p 


If we introduce equation (5.35) into (5.55), we have 


MT 


e* m ,(0)@ Im (0)sm6d0 

0 


*2 71 

0 


dwd 


mm' 


The integral over the angle tp is 


•2ir 

®*A(p)®m((p)d(p 

0 


1 

2 n 


*2 Jt 

e -im'<p e im<p d(() 

0 


1 

2jz 


*2 Jt 

^i(m— m')(p 

0 


dtp 


where equation (5.52) has been introduced. Since m and m' are integers, this 
integral vanishes unless m = m\ so that 

*2 71 

0*,(<p)0 w (<p)d(p = d mm > (5.56) 

. 0 

from which it follows that 

0* BI (0)© /m (0) sin0d0 = <5»« (5.57) 

. 0 

Note that in equation (5.57) the same value for m appears in both ®* m (6) and 
0/,„($). Thus, the functions 0 i m (d ) and ®r m (d ) for / / /' are orthogonal, but 
the functions 0/ w (0) and are not orthogonal. However, for m 7 ^ w', 

the spherical harmonics T/,„(0, <p) and T/- m '(0, <p) are orthogonal because of 
equation (5.56). 
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Relationship of spherical harmonics to associated Legendre polynomials 

The functions ©i,„(0) and consequently the spherical harmonics Yi m (6, q>) are 
related to the associated Legendre polynomials, whose definition and properties 
are presented in Appendix E. To show this relationship, we make the substitu¬ 
tion of equation (5.42) for cos 6 in equation (5.51) and obtain 


0 — 


nr 

2>l\ 




( / + m)\ 


dp 


(5.58) 


Equation (E.13) relates the associated Legendre polynomial P™(pt) to the 
( / + m)th-order derivative in equation (5.58) 

r-w = f-p- f rr l2 ^;M-yy 

where / and m are positive integers (/, m 0) such that m =£ /. Thus, for 
positive m we have the relation 




(21 +!)(/— m)\ 


PT (cos 9), 


m 


0 


y 2 (/ + m)\ 1 

For negative m, we may write m = —\m\ and note that equation (5.53) states 


0 /HH (0) = (-ir0/, w (0) 


so that we have 


®i,-\ m \(d) = 

These two results may be combined as 

®im(0) = e 


1(21 + 1)(/ — 

m\)\ 

/ 2 (/+ | 

m\)\ 


Pj (cos 0) 


1(21 +!)(/ — 

m\)\ 

/ 2 (/+| 

m\)\ 


P'i( cos 0) 


where e = (—1)"' for m > 0 and e = 1 for m ^ 0. Accordingly, the spherical 
harmonics Y/ m (6, <p) are related to the associated Legendre polynomials by 


Yim(0, (p) = £ 

e = (~D 


l(2l+\)(l-\ 

m\)\ 

1 4 jt (l + | 

m\)\ 


Pj (cos 6)e 


i m(p 


m > 0 


(5.59) 


= 1, m ^ 0 

The eigenvalues and eigenfunctions of the orbital angular momentum 
operator L 2 may also be obtained by solving the differential equation 
Lrxp = Xfi 2 >p using the Frobenius or series solution method. The application of 
this method is presented in Appendix G and, of course, gives the same results 
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as the procedure using ladder operators. However, the Frobenius method may 
not be used to obtain the eigenvalues and eigenfunctions of the generalized 
angular momentum operators J 2 and ./- because their eigenfunctions do not 
have a spatial representation. 


5.4 The rigid rotor 


The motion of a rigid diatomic molecule serves as an application of the 
quantum-mechanical treatment of angular momentum to a chemical system. A 
rigid diatomic molecule consists of two particles of masses m\ and m 2 which 
rotate about their center of mass while keeping the distance between them fixed 
at a value R. Although a diatomic molecule also undergoes vibrational motion 
in which the interparticle distance oscillates about some equilibrium value, that 
type of motion is neglected in the model being considered here; the interparti¬ 
cle distance is frozen at its equilibrium value R. Such a rotating system is 
called a rigid rotor. 

We begin with a consideration of a classical particle i with mass in, rotating 
in a plane at a constant distance r, from a fixed center as shown in Figure 5.2. 
The time r for the particle to make a complete revolution on its circular path is 
equal to the distance traveled divided by its linear velocity Vj 


r = 


Ijirt 


(5.60) 


The reciprocal of r gives the number of cycles per unit time, which is the 
frequency v of the rotation. The velocity v, may then be expressed as 

Ijtfj 


v, = 


= 2jtvr: = COTi 


(5.61) 


where a> = 2jtv is the angular velocity. According to equation (5.1), the 
angular momentum L, of particle i is 

L i = r, X p, = m,(r,' X v, : ) (5.62) 


Since the linear velocity vector v, is perpendicular to the radius vector r„ the 
magnitude L, of the angular momentum is 



Figure 5.2 Motion of a rotating particle. 
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Li = m,riVj sin (jt/2) = nijrjVj = 0 ) m,r 2 (5.63) 

where equation (5.61) has been introduced. 

We next apply these classical relationships to the rigid diatomic molecule. 
Since the molecule is rotating freely about its center of mass, the potential 
energy is zero and the classical-mechanical Hamiltonian function H is just the 
kinetic energy of the two particles, 

H = ^ + ^ = \m x v\ + \m 2 v 2 2 (5.64) 

2 m\ 2ni2 

If we substitute equation (5.61) for each particle into (5.64) while noting that 
the angular velocity a> must be the same for both particles, we obtain 

H = \a> 2 (m\r\ + m 2 r 2 ) = \lor (5.65) 

where we have defined the moment of inertia I by 

/ = ni\ r 2 + m 2 r\ (5.66) 

In general, moments of inertia are determined relative to an axis of rotation. 
In this case the axis is perpendicular to the interparticle distance R and passes 
through the center of mass. Thus, we have 

r\ + r 2 = R 
and 


m\ r\ = m 2 r 2 


or, upon inversion 


r\ = 


r 2 = 


m 2 


m i + m 2 
m\ 


-R 


R 


(5.67) 


(5.68) 


(5.69) 


m i + m 2 

Substitution of equations (5.67) into (5.66) gives 

/ = fiR 2 

where the reduced mass u is defined by 

m i m 2 

H = - 

m | + m 2 

The total angular momentum L for the two-particle system is given by 

L = L\ + L 2 = a>(m\r\ + m 2 r\) = Ia> (5.70) 

where equations (5.63) and (5.66) are used. A comparison of equations (5.65) 
and (5.70) shows that 

L 2 


H = 


21 


(5.71) 
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Accordingly, the quantum-mechanical Hamiltonian operator H for this system 
is proportional to the square of the angular momentum operator 1 ? 

k = Yi 12 (5 - 72) 

Thus, the operators H and L 2 have the same eigenfunctions, namely, the 
spherical harmonics Y Jm (0, <p) as given in equation (5.50). It is customary in 
discussions of the rigid rotor to replace the quantum number / by the index J in 
the eigenfunctions and eigenvalues. 

The eigenvalues of H are obtained by noting that 

HYjJO, cp) = ^L 2 Y Jm {6, <p ) = J V+W 2 Yjm (d, <p) (5.73) 

where / is replaced by Jin equation (5.28a). Thus, the energy levels Ej for the 
rigid rotor are given by 

h 2 

Ej = J{J+\)— = J{J+\)B, J = 0,1,2,... (5.74) 

where B = fi 2 /2I is the rotational constant for the diatomic molecule. The 
energy levels Ej are shown in Figure 5.3. We observe that as J increases, the 
difference between successive levels also increases. 


Energy 
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J= T g4 — 9 


125 


J= 3, g 3 = ? 


65 


J= 2, g 2 = 5 
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0 


J= l.gi = 3 
J= 0, go = 1 


Figure 5.3 Energy levels of a rigid rotor. 
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To find the degeneracy of the eigenvalue Ej, we note that for a given value 
ofthe quantum number m has values m = 0, ±1, ±2, ..., ±J. Accordingly, 
there are (2 J + 1) spherical harmonics for each value of J and the energy level 
Ej is (2.7 + l)-fold degenerate. The ground-state energy level E 0 is non¬ 
degenerate. 


5.5 Magnetic moment 


Atoms are observed to have magnetic moments. To understand how an electron 
circulating about a nuclear core can give rise to a magnetic moment, we may 
apply classical theory. We consider an electron of mass m e and charge —e 
bound to a fixed nucleus of charge Ze by a central coulombic force F(r) with 
potential V(r) 


F{r) 


d V(r) -Ze 2 


V (r) 


dr 4jt£or 2 
-Ze 2 
4^reo r 


(5-75) 

(5.76) 


Equation (5.75) is Coulomb’s law for the force between two charged particles 
separated by a distance r. In SI units, the charge e is expressed in coulombs 
(C), while £o is the permittivity of free space with the value 
£o = 8.854 19 X 1(T 12 r'C 2 m- 1 


According to classical mechanics, a stable circular orbit of radius r and angular 
velocity to is established for the electron if the centrifugal force m e ra> 2 
balances the attractive coulombic force 


m e rco 2 


Ze 2 
4jt£q r 2 


This assumption is the basis of the Bohr model for the hydrogen-like atom. 
When solved for to, this balancing equation is 

f Ze 2 ' 1/2 


0 ) = 


\4jr£o m e r : 


(5-77) 


An electron in a circular orbit with an angular velocity a> passes each point 
in the orbit a>/2n times per second. This electronic motion constitutes an 
electric current /, defined as the amount of charge passing a given point per 
second, so that 


ea> 

In 


(5.78) 


From the definition of the magnetic moment in electrodynamics, a circulat- 
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ing current / enclosing a small area A gives rise to a magnetic moment M of 
magnitude M given by 

M=IA (5.79) 

The area A enclosed by the circular electronic orbit of radius r is Jir 2 . From 
equation (5.63) we have the relation L = m e cor 2 . Thus, the magnitude of the 
magnetic moment is related to the magnitude L of the angular momentum by 

M = ^~ ( 5 . 80 ) 

2 iHq 


The direction of the vector L is determined by equation (5.62). By convention, 
the direction of the current / is opposite to the direction of rotation of the 
negatively charged electron, i.e., opposite to the direction of the vector v. 
Consequently, the vector M points in the opposite direction from L (see Figure 
5.4) and equation (5.80) in vector form is 


M 


—e 
2 m e 


h~ 


(5.81) 


Since the units of L are those of h, we have defined in equation (5.81) the Bohr 
magneton /Ub as 

/r B = ~g~~ — 9.274 02 X 10~ 24 JT” 1 (5.82) 

2 m e 


The relationship (equation (5.81)) between M and L depends only on 
fundamental constants, the electronic mass and charge, and does not depend on 
any of the variables used in the derivation. Although this equation was obtained 
by applying classical theory to a circular orbit, it is more generally valid. It 
applies to elliptical orbits as well as to classical motion with attractive forces 
other than r~ 2 dependence. For any orbit in any central force field, the angular 



Figure 5.4 The magnetic moment M and the orbital angular momentum L of an 
electron in a circular orbit. 
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momentum is conserved and, since equation (5.81) applies, the magnetic 
moment is constant in both magnitude and direction. Moreover, equation (5.81) 
is also valid for orbital motion in quantum mechanics. 


Interaction with a magnetic field 

The potential energy V of an atom with a magnetic moment M in a magnetic 
field B is 

V = —M • B = —MB cos 0 (5.83) 

where 0 is the angle between M and B. The force F acting on the atom due to 
the magnetic field is 

F = -VV 


or 


dB BB 

F x = — M • —— = — M cos 6 —— 
dx dx 

dB BB 

F y = —M • —— = — AT cos 6—- 
By By 

9B BB 

F- = —M • — = —M cos 0 — 
Bz Bz 


(5.84) 


If the magnetic field is uniform, then the partial derivatives of B vanish and the 
force on the atom is zero. 

According to electrodynamics, the force F for a non-uniform magnetic field 
produces on the atom a torque T given by 


T = M X B = -^L X B 
h 


(5.85) 


where equation (5.81) has been introduced as well. From the relation T = 
dL/d t in equation (5.6), we have 


dL 

d/ 


-^L X B 
h 


(5.86) 


Thus, the torque changes the direction of the angular momentum vector L and 
the vector dL/d t is perpendicular to both L and B, as shown in Figure 5.5. As 
a result of this torque, the vector L precesses around the direction of the 
magnetic field B with a constant angular velocity a> l. This motion is known as 
Larmor precession and the angular velocity a>i is called the Larmor frequency. 
Since the magnetic moment M is antiparallel to the angular moment L, it also 
precesses about the magnetic field vector B. 

From equation (5.61), the Larmor angular frequency or velocity a> L is equal 
to the velocity of the end of the vector L divided by the radius of the circular 
path shown in Figure 5.5 
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dL 

dt 


Figure 5.5 The motion in a magnetic field B of the orbital angular momentum vector L. 


__ jdL/d t\ 

L L sin 9 

The magnitude of the vector dL/d t is obtained from equation (5.86) as 


dL 

dt 


~h 


LB sin 9 


so that 


0) L 


Bb B 
~h~ 


(5.87) 


If we take the z-axis of the coordinate system parallel to the magnetic field 
vector B, then the projection of L on B is L z and cos 9 in equation (5.83) is 

L z 

cos 6 = — 

In quantum mechanics, the only allowed values of L are \/'l(! + 1) h with 
/ = 0, 1, ... and the only allowed values of L : are mh with m = 0, ±1, ..., 
±1. Accordingly, the angle 9 is quantized, being restricted to values for which 
m 

cos 9 = , / = 0,1,2,..., m = 0, ±1, ..., ±1 (5.88) 

v 1(1 + 1) 


The possible orientations of L with respect to B for the case 1 = 3 are 
illustrated in Figure 5.6. Classically, all values between 0 and jt are allowed for 
the angle 9. When equations (5.81) and (5.88) are substituted into (5.83), we 
find that the potential energy V is also quantized 

V = mfiB B, m = 0, ±1, ..., ±1 


(5.89) 


Problems 


155 



Figure 5.6 Possible orientations in a magnetic field B of the orbital angular momentum 
vector L for the case 1 — 3 


Problems 

5.1 Show that each of the operators L x , L y , L- is hermitian. 

5.2 Evaluate the following commutators: 

(a) [L x , x ] (b) [L x , p x ] (c) [L x , y ] (d) [L x , p y ] 

5.3 Using the commutation relation (5.10b), find the expectation value of L x for a 
system in state | Im). 

5.4 Apply the uncertainty principle to the operators L x and L y to obtain an expres¬ 
sion for AL x AL y . Evaluate the expression for a system in state | Ini). 

5.5 Show that the operator .7 2 commutes with J x and with 

5.6 Show that J + and ./ as defined by equations (5.18) are adjoints of each other. 

5.7 Prove the relationships (5.19a)-(5.19g). 

5.8 Show that the choice for c in equation (5.24) is consistent with c + in equation 
(5.22). 

5.9 Using the raising and lowering operators J + and ./ , show that 

(jm\j x \jm) - (jm\J y \jm ) - 0 

5.10 Show that 

(Jm\J 2 x \jm) = (Jm\J 2 y \jm) - )jj(j + l) - m 2 ]h 2 

5.11 Show that [/, m) are eigenfunctions of [J x , J ] and of \J y , J . ]. Find the 
eigenvalues of each of these commutators. 
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The hydrogen atom 


A theoretical understanding of the structure and behavior of the hydrogen atom 
is essential to the fields of physics and chemistry. As the simplest atomic 
system, hydrogen must be understood before one can proceed to the treatment 
of more complex atoms, molecules, and atomic and molecular aggregates. The 
hydrogen atom is one of the few examples for which the Schrodinger equation 
can be solved exactly to obtain its wave functions and energy levels. The 
resulting agreement between theoretically derived and experimental quantities 
serves as confirmation of the applicability of quantum mechanics to a real 
chemical system. Further, the results of the quantum-mechanical treatment of 
atomic hydrogen are often used as the basis for approximate treatments of more 
complex atoms and molecules, for which the Schrodinger equation cannot be 
solved. 

The study of the hydrogen atom also played an important role in the 
development of quantum theory. The Lyman, Balmer, and Paschen series of 
spectral lines observed in incandescent atomic hydrogen were found to obey 
the empirical equation 



where v is the frequency of a spectral line, c is the speed of light, n\ = 1, 2, 3 
for the Lyman, Balmer, and Paschen series, respectively, n 2 is an integer 
determining the various lines in a given series, and R is the so-called Rydberg 
constant, which has the same value for each of the series. Neither the existence 
of these spectral lines nor the formula which describes them could be explained 
by classical theory. In 1913, N. Bohr postulated that the electron in a hydrogen 
atom revolves about the nucleus in a circular orbit with an angular momentum 
that is quantized. He then applied Newtonian mechanics to the electronic 
motion and obtained quantized energy levels and quantized orbital radii. From 
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the Planck relation A E = E, h — E„ 2 = hv, Bohr was able to reproduce the 
experimental spectral lines and obtain a theoretical value for the Rydberg 
constant that agrees exactly with the experimentally determined value. Further 
investigations, however, showed that the Bohr model is not an accurate 
representation of the hydrogen-atom structure, even though it gives the correct 
formula for the energy levels, and led eventually to Schrodinger’s wave mech¬ 
anics. Schrodinger also used the hydrogen atom to illustrate his new theory. 


6.1 Two-particle problem 

In order to apply quantum-mechanical theory to the hydrogen atom, we first 
need to find the appropriate Hamiltonian operator and Schrodinger equation. 
As preparation for establishing the Hamiltonian operator, we consider a 
classical system of two interacting point particles with masses m \ and m 2 and 
instantaneous positions ri and r 2 as shown in Figure 6.1. In terms of their 
cartesian components, these position vectors are 

ri = ixi + jyi + kzi 


r 2 = ix 2 + jy 2 + kz 2 

The vector distance between the particles is designated by r 

r = r 2 - iq = ix +jy + kz (6.1) 

where 

X = X 2 -X 1 , y = y 2 -yi, z = z 2 - z\ 

The center of mass of the two-particle system is located by the vector R with 
cartesian components, X, Y, Z 



Figure 6.1 The center of mass (CM) of a two-particle system. 
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R = iX + ]Y + kZ 

By definition, the center of mass is related to ri and r 2 by 

m\V\ + 1712*2 


R 


M 


( 6 . 2 ) 


where M = ni\ + w 2 is the total mass of the system. We may express ri and r 2 
in terms of R and r using equations (6.1) and (6.2) 


m 2 

ri = R-r 

M 

„ m i 

r 2 = R d-- r 

M 


(6.3) 


If we restrict our interest to systems for which the potential energy V is a 
function only of the relative position vector r, then the classical Hamiltonian 
function H is given by 


H = 


iPi 


IP2I 


+ V(r) 


(6.4) 


Pi = m 


2 m 1 2 m 2 

where the momenta pi and p 2 for the two particles are 

dri dr 2 

dT* P2 = W2 df 

These momenta may be expressed in terms of the time derivatives of R and r 
by substitution of equation (6.3) 

^R m 2 dr^ 
d t M d t. 


Pi = m 


dR m 1 dr 

P2 = '” ! ld7 + «dt 


(6.5) 


Substitution of equation (6.5) into (6.4) yields 


H = \M 


dR 

d t 


+ \}i 


dr 

At 


+ V(r) 


( 6 . 6 ) 


where the cross terms have canceled out and we have defined the reduced mass 
M by 

m\ m 2 m\ m 2 

/u = -= ——— (6.7) 

m 1 + m 2 M 

The momenta p« and p r , corresponding to the center of mass position R and 
the relative position variable r, respectively, may be defined as 

dr 

p ; . = fi- 


dR 

p ^M-, 


dr 


In terms of these momenta, the classical Hamiltonian becomes 
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H = 

2 M 



f m 


( 6 . 8 ) 


We see that the kinetic energy contribution to the Hamiltonian is the sum of 
two parts, the kinetic energy due to the translational motion of the center of 
mass of the system as a whole and the kinetic energy due to the relative motion 
of the two particles. Since the potential energy V(r) is assumed to be a function 
only of the relative position coordinate r, the motion of the center of mass of 
the system is unaffected by the potential energy. 

The quantum-mechanical Hamiltonian operator H is obtained by replacing 
|p«| 2 and |p, | 2 in equation (6.8) by the operators — fr 2 V| and —h 2 V 2 r , respec¬ 
tively, where 


2 < 9 2 d 2 d 2 

R = dX 2 + 8Y 2 + dZ 2 

2 d 2 d 2 d 2 

V~ = -1-1- 

' dx 2 dy 2 dz 2 


(6.9a) 

(6.9b) 


The resulting Schrodinger equation is, then, 


ft 2 

2M 


v 2 * 



+ V (r) 


^(R, r) 


£W(R, r) 


( 6 . 10 ) 


This partial differential equation may be readily separated by writing the 
wave function V P(R, r) as the product of two functions, one a function only of 
the center of mass variables X, Y, Z and the other a function only of the relative 
coordinates x, y, z 

W(R, r) = x(X, Y, Z)y>(x, y, z) = yJRytplr) 

With this substitution, equation (6.10) separates into two independent partial 
differential equations 

~Ym v2r%(R) = Er%{R) (6 - U) 

- y V 2 M r ) + y ( r )V ; ( r ) = ErV’(v) (6.12) 

where 


E — Er + E r 

Equation (6.11) is the Schrodinger equation for the translational motion of a 
free particle of mass M, while equation (6.12) is the Schrodinger equation for a 
hypothetical particle of mass u moving in a potential field V(r). Since the 
energy E R of the translational motion is a positive constant (E R 3= 0), the 
solutions of equation (6.11) are not relevant to the structure of the two-particle 
system and we do not consider this equation any further. 
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6.2 The hydrogen-like atom 


The Schrodinger equation (6.12) for the relative motion of a two-particle 
system is applicable to the hydrogen-like atom, which consists of a nucleus of 
charge +Ze and an electron of charge —e. The differential equation applies to 
H for Z = 1, He + for Z = 2, Li 2+ for Z = 3, and so forth. The potential energy 
V(r) of the interaction between the nucleus and the electron is a function of 
their separation distance r = |r| = (x 2 + y 2 + z 2 ) 1 / 2 and is given by Coulomb’s 
law (equation (5.76)), which in SI units is 


V(r) 


Ze 2 


where meter is the unit of length, joule is the unit of energy, coulomb is the 
unit of charge, and £o is the permittivity of free space. Another system of units, 
used often in the older literature and occasionally in recent literature, is the 
CGS gaussian system, in which Coulomb’s law is written as 

Ze 2 

V (r) = - 


r 

In this system, centimeter is the unit of length, erg is the unit of energy, and 
statcoulomb (also called the electrostatic unit or esu) is the unit of charge. In 
this book we accommodate both systems of units and write Coulomb’s law in 
the form 


Ze' 2 

V(r) =-- (6.13) 

r 

where e' = e for CGS units or <?' = e/(4jT.£ 0 ) 1//2 for SI units. 

Equation (6.12) cannot be solved analytically when expressed in the 
cartesian coordinates x, y, z, but can be solved when expressed in spherical 
polar coordinates r, 6, <p, by means of the transformation equations (5.29). The 
laplacian operator V 2 in spherical polar coordinates is given by equation (A.61) 
and may be obtained by substituting equations (5.30) into (6.9b) to yield 

1 d f . d \ 1 d 2 

sin 6 d6 \ m dOJ ^ sin 2 6 dtp 2 


1 d 


8 


Vf = — — r z — + — 
' r 2 dr \ dr 


If this expression is compared with equation (5.32), we see that 


_LA ( r 2 ^\ _ —L 

r 2 dr\ dr) h 2 r 2 


where L 2 is the square of the orbital angular momentum operator. With the 
laplacian operator V 2 expressed in spherical polar coordinates, the Schrodinger 
equation ( 6 . 12 ) becomes 



6.3 The radial equation 
Hrp(r, 0, cp) = Eip(r, 6, cp) 
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with 


k = ~J~ 2 J~ ( r2 §-) + y~^ L 2+v ^ ( 6 - 14 ) 

2/ur z or \ orj 2 f.ir A 

The operator L 2 in equation (5.32) commutes with the Hamiltonian operator 
H in (6.14) because I? commutes with itself and does not involve the variable 
r. Likewise, the operator L z in equation (5.31c) commutes with H because it 
commutes with L 2 as shown in (5.15a) and also does not involve the variable r. 
Thus, we have 

[H, L 2 ] = 0, [H, L z ] = 0, [ L 2 , L z \ = 0 

and the operators H, L 2 , and L z have simultaneous eigenfunctions, 

6 , cp) = Eip(r, 0, cp) (6.15a) 

L 2 ip(r, 0, cp) = /(/ + \)fi 2 xp{r, d, cp), l = 0, 1, 2, ... (6.15b) 

L z tp(r, 6, cp) = mhip(r, 6, (p), m = —/,—/+ 1 , ...,/— 1 , / (6.15c) 

The simultaneous eigenfunctions of L 2 and L z are the spherical harmonics 
Yi m (6, cp) given by equations (5.50) and (5.59). Since neither L 2 nor L z involve 
the variable r, any specific spherical harmonic may be multiplied by an 
arbitrary function of r and the result is still an eigenfunction. Thus, we may 
write Tp(r, 6, cp) as 

ip(r, d, tp) = R(r)Y/ m (6, cp) (6.16) 


Substitution of equations (6.13), (6.14), (6.15b), and (6.16) into (6.15a) gives 

H,R(r) = ER(r) (6.17) 

where 


H, = - 


fi 2 

2 /ur 2 


57 + 


Ze 


1 2 


and where the common factor Yi m (9, cp) has been divided out. 


(6.18) 


6.3 The radial equation 

Our next task is to solve the radial equation (6.17) to obtain the radial function 
R{ r) and the energy E. The many solutions of the differential equation (6.17) 
depend not only on the value of /, but also on the value of E. Therefore, the 
solutions are designated as 7?£/(r). Since the potential energy —Ze' 2 /r is 
always negative, we are interested in solutions with negative total energy, i.e., 
where E =£ 0. It is customary to require that the functions R/:i(t') be normal- 
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ized. Since the radial part of the volume element in spherical coordinates is 
r 2 dr, the normalization criterion is 


[R E ,(r)] 2 r 2 dr = 1 


Through an explicit integration by parts, we can show that 


(6.19) 


RFj(r)[HiR F 'i(r)]r 2 dr = 


R E 'l(r)[HiR FJ (r)]r 2 dr 


Thus, the operator Hi is hermitian and the radial functions R E i(r) constitute an 
orthonormal set with a weighting function w(r) equal to r 2 


Rm(r)R E 'i(r)r 2 dr = 6 


EE' 


( 6 . 20 ) 


where d EE ’ is the Kronecker delta and equation (6.19) has been included. 
We next make the following conventional change of variables 

pZe' 2 


A = 


h(-2pE) 1 / 2 


_ 2(-2 pE) l / 2 r _ 2pZe' 2 r 
9 ~ Ji “ Ail 2 = 
where a fl = fi 2 /lie' 2 . We also make the substitution 

'2 Z\ 3/2 . 

\A 

Equations (6.17) and (6.18) now take the form 


2 Zr 
Aa f , 


Rei(t) = -— S)j(p) 


f W- + 2p ^ ^ + ^ Xp " t) Su ~ /(/ + l)Su 


( 6 . 21 ) 

( 6 . 22 ) 

(6.23) 

(6.24) 


where the first term has been expanded and the entire expression has been 
multiplied by p 2 . 

To be a suitable wave function, S)j(p) must be well-behaved, i.e., it must be 
continuous, single-valued, and quadratically integrable. Thus, pS>j vanishes 
when p —► oo because Su must vanish sufficiently fast. Since S)j is finite 
everywhere, pS^i also vanishes at p = 0. Substitution of equations (6.22) and 
(6.23) into (6.19) shows that Snip) is normalized with a weighting function 
w(p) equal to p 2 

*oo 

[SMfp 2 dp= 1 (6.25) 

. o 

Equation (6.24) may be solved by the Frobenius or series solution method as 
presented in Appendix G. However, in this chapter we employ the newer 
procedure using ladder operators. 
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Ladder operators 

We now solve equation (6.24) by means of ladder operators, analogous to the 
method used in Chapter 4 for the harmonic oscillator and in Chapter 5 for the 
angular momentum . 1 We define the operators Ax and B> as 

A x = -p^--^ + X-\ (6.26a) 

dp 2 

Bx = p^-^ + X (6.26b) 

We now show that the operator A ; is the adjoint of B A and vice versa. Thus, 

neither A ; nor B-, is hermitian. For any arbitrary well-behaved functions f(p) 

and g(p), we consider the integral 

f 00 f 00 dg f 00 / p \ 

f(.p)[Ag(p)]dp = - fp—dp+ f[-- + ^-l)gdp 
Jo Jo dp Jo V 2 J 

where (6.26a) has been used. Integration by parts of the first term on the right- 

hand side with the realization that the integrated part vanishes yields 

r°° „ f 00 d r 00 / p \ 

fAxgdp= g — (p/) dp + /(—^ + 2—llgdp 

Jo Jo dp Jo V 2 J 


=.,”K4“f +A ) /d ' > 


Substitution of (6.26b) gives 


f(p)[Aig(p)]dp= g(p)[Bxf(p)]dp 


showing that, according to equation (3.33) 

a\ = Bx, B'x = Ax 

We readily observe from (6.26a) and (6.26b) that 

~ - d 2 d o 2 

B x Ax = ~P 2 ^~2p— -2p + — + 2(2- 1) 

A x Bx = -p 2 ^ “ 2 P J~ p ~ “ 1 ">P + + 2(2 “ ! ) 

Equation (6.24) can then be written in the form 

BxAxSxi = [2(2 -!)-/(/ +l)\Sx, 


(6.27) 

(6.28a) 

(6.28b) 

(6.29) 


showing that the functions Sxi(p) are also eigenfunctions of B;Ax- From 
equation (6.28b) we obtain 


1 We follow here the treatment by D. D. Fitts (1995) J. Chem. Educ. 72, 1066. However, the definitions of the 
lowering operator and the constants an and bxt have been changed. 
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AxhSx-u = [A(A - 1) - /(/ + 1 M_ u (6.30) 

when X is replaced by X — 1 in equation (6.24). 

If we operate on both sides of equation (6.29) with the operator Ax, we 
obtain 

AxBxAxSx, = [A(A - I) - Id + 1 )JAxSxi (6.31) 

Comparison of this result with equation (6.30) leads to the conclusion that 
AxSxi and Sx-ij are, except for a multiplicative constant, the same function. 
We implicitly assume here that Sxi is uniquely determined by only two 
parameters, X and /. Accordingly, we may write 

AxSxi = axiSx-i,i (6.32) 

where axi is a numerical constant, dependent in general on the values of X and 
/, to be determined by the requirement that Sx\ and Sx-\j be normalized. 
Without loss of generality, we can take axi to be real. The function AxSxi is an 
eigenfunction of the operator in equation (6.24) with eigenvalue decreased by 
one. Thus, the operator Ax transforms the eigenfunction Sxi determined by X, 1 
into the eigenfunction Sx~ij determined by X — 1, /. For this reason the 
operator Ax is a lowering ladder operator. 

Following an analogous procedure, we now operate on both sides of equation 
(6.30) with the operator B, to obtain 

BxAxBxSx-u = [A(A - 1) - Id + 1 )]BxSx-u (6.33) 

Comparing equations (6.29) and (6.33) shows that B,Sx-\j and Sxi are 
proportional to each other 

BxSx-\j = bxiSxi (6.34) 

where bxi is the proportionality constant, assumed real, to be determined by the 
requirement that Sx~ij and Sxi be normalized. The operator B; transforms the 
eigenfunction Sx~ij into the eigenfunction Sxi with eigenvalue X increased by 
one. Accordingly, the operator Bx is a raising ladder operator. 

The next step is to evaluate the numerical constants axi and bxi- In order to 
accomplish these evaluations, we must first investigate some mathematical 
properties of the eigenfunctions Sxi(p). 

Orthonormal properties of Sxi(p) 

Although the functions R„i(r) according to equation (6.20) form an orthogonal 
set with w(r) = r 2 , the orthogonal relationships do not apply to the set of 
functions Sxi(p) with w(p) = p 2 . Since the variable p introduced in equation 
(6.22) depends not only on r, but also on the eigenvalue E, or equivalently on 
X, the situation is more complex. To determine the proper orthogonal relation¬ 
ships for Sxi(p), we express equation (6.24) in the form 
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H'iSu = -XS, 


where H\ is defined by 


H', = p 


u 


d 2 | 2 d p /(/+!) 

dp 2 dp 4 p 


(6.35) 


(6.36) 


By means of integration by parts, we can readily show that this operator H) is 
hermitian for a weighting function vv(p) equal to p, thereby implying the 
orthogonal relationships 


SMS r ,(p)pdp = 0 for X^X' 


(6-37) 


In order to complete the characterization of integrals of Sxi(p), we need to 
consider the case where X = X' for w(p) = p. Recall that the functions Su(p) 
are normalized for w(p) = p 2 as expressed in equation (6.25). The same result 
does not apply for w(p) = p. We begin by expressing the desired integral in a 
slightly different form 


[Sn(p)] 2 P dp = \ 


[^/(p)] 2 d(p 2 ) 


Integration of the right-hand side by parts gives 


[S„(p)] 2 pdp = l 


p 2 [SMf 


p 2 Su 


dp 


dp 


If Snip) is well-behaved, the integrated term vanishes. From equation (6.26a) 
we may write 

so that 

= -A)Su - \pS u + {X - 1)5)./ 

= — auSx-u — \pSxi + (X — 1 ) 5 ^/ 

where equation (6.32) has been introduced. The integral then takes the form 


[Sxi(p)fp dp = a u 


SxiSx-ijP dp + ^ 


[Sx,fp 2 dp 


- a -1) 


[Sufpdp 


Since the first integral on the right-hand side vanishes according to equation 
(6.37) and the second integral equals unity according to (6.25), the result is 
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[SM] 2 pdp = - 

Combining equation (6.38) with (6.37), we obtain 

f 00 1 

Su(p)Svi(p)p dp = 2^ow 


(6.38) 


(6.39) 


Evaluation of the constants an and bn 

To evaluate the numerical constant an, which is defined in equation (6.32), we 
square both sides of (6.32), multiply through by p, and integrate with respect to 
p to obtain 


•OO pOO 

p(A)Sn)(AiSn) dp = a\, (Sx-ufp dp 
Jo Jo 


(6.40) 


Application of equation (6.27) with / = pA x Sn and g = Sn to the left-hand 
side and substitution of equation (6.38) on the right-hand side give 


•OO 

SkiBfpAiSn) dp = a 2 u /2(X — 1 ) 

. o 

The expression B x (pA x Sn) may be simplified as follows 

BfpAiSki) = p — ( pAk.Sk /) + ^ + 2 -^ pAkSn 

= pA\Sn + p" ~r~ (A-kSki) + p (— k. + A x S X i 


(6.41) 


= pAkSxi + pBxAxSn 

= paxiSk-\,i + [MX — 1) — /(/ + 1 )]P‘S'a/ 

where equations (6.26b), (6.32), and (6.29) have been used. When this result is 
substituted back into (6.41), we have 

*oo poo 

an SnSx-uP dp + [X(X - 1) - /(/ + 1)] S 2 u p dp = 4/2(2 - 1) (6.42) 
. o Jo 

According to equation (6.39), the first integral vanishes and the second integral 

equals (22) _1 , giving the result 

4 = f^W-!)-/(/+!)] 


X - 1 


(X + l)(X - l - 1 ) 


(6.43) 


Substitution into (6.32) gives 
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f/;_i\ ]!/2 

A X S U = y (^—J (A + /)(2 - l - 1)J Sx- 1,1 (6.44) 

where we have arbitrarily taken the positive square root. 

The numerical constant bn, defined in equation (6.34), may be determined 
by an analogous procedure, beginning with the square of both sides of equation 
(6.34) and using equations (6.27), (6.26a), (6.34), (6.30), and (6.39). We obtain 

bh = ( I 4y) [2(2 -!)-/(/+1)] = W - 1 - !) ( 6 - 45 ) 

so that equation (6.34) becomes 

r / 2 \ 1 1/2 

BiSi-u = y (j —-J (2 + 0(2 - / - 1)J S X J (6.46) 

Taking the positive square root here will turn out to be consistent with the 
choice in equation (6.44). 

Quantization of the energy 

The parameter X is positive, since otherwise the radial variable p, which is 
inversely proportional to 2, would be negative. Furthermore, the parameter X 
cannot be zero if the transformations in equations (6.21), (6.22), and (6.23) are 
to remain valid. To find further restrictions on X we must consider separately 
the cases where 1 = 0 and where / 5 s 1. 

For / = 0, equation (6.44) takes the form 

AxSxo = (2 - l)Sx-i,o (6.47) 

Suppose we begin with a suitably large value of 2, say §, and continually apply 
the lowering operator to both sides of equation (6.47) with 2 = § 

Ag-iAgSgo = (B — 1)(£ — 2) < S'|_2,o 
h-2-h -tMo = (£ - 1)(4 - m - 3)^-3,0 

Eventually this procedure produces an eigenfunction S^k.o, A'being a positive 
integer, such that 0 <(£ — A) =£ 1. The next step in the sequence would give a 
function S^k-\,o or Sx o with X = (| — A — 1) =£ 0, which is not allowed. Thus, 
the sequence must terminate with the condition 

A^-kS^-k, 0 = (£-&- 1 )^— 1,0 = 0 

which can only occur if (§ — k) = 1. Thus, § must be an integer and the 
minimum value of X for / = 0 is 2 = 1. 

For the situations in which / 3= 1, we note that the quantities a\ t in equation 



168 


The hydrogen atom 

(6.43) and h\ A in equation (6.45), being squares of real numbers, must be 
positive. Consequently, the factor (2 — / — 1) must be positive, so that 
A 2 s (/ + 1). 

We now select some appropriately large value § of the parameter A in 
equation (6.44) and continually apply the lowering operator to both sides of the 
equation in the same manner as in the l = 0 case. Eventually we obtain Sg-kj 
such that (/+1)*S(£— k)< (1 + 2). The next step in the sequence would 
give St-k-ij or Sn with A = (£ — k — 1) < (7 + 1), which is not allowed, so 
that the sequence must be terminated according to 

A%-kS%-k,l = a Z-k,lS%-k-\,l 

" ft _ k — 1 \ 1 

= %_ k J -k-l- 1 ) S^k-u 

= 0 


for some value of k. Thus, £ must be an integer for a^-kj to vanish. As k 
increases during the sequence, the constant at_ kl vanishes when 
k = (g — l — 1) or ( 5 — k) = (7 + 1). The minimum value of A is then / + 1. 

Combining the conclusions of both cases, we see that the minimum value of 
A is / + 1 for / = 0, 1, 2, ... Beginning with the value A = / + 1, we can apply 
equation (6.46) to yield an infinite progression of eigenfunctions S n i(p) for each 
value of / (1 = 0, 1 , 2, ...), where A can take on only integral values, 
A = n = l + 1, 1 + 2, 1 + 3, ... Since § in both cases was chosen arbitrarily 
and was shown to be an integer, equation (6.46) generates all of the eigenfunc¬ 
tions S)j(p) for each value of l. There are no eigenfunctions corresponding to 
non-integral values of A. Since A is now shown to be an integer n, in the 
remainder of this presentation we replace A by n. 

Solving equation (6.21) for the energy E and replacing A by n, we obtain the 
quantized energy levels for the hydrogen-like atom 


f.iZ 2 e' A _ ZV 
2 h 2 n 2 2 a /( « 2 ’ 


(6.48) 


These energy levels agree with the values obtained in the earlier Bohr theory. 

Electronic energies are often expressed in the unit electron volt (eV). An 
electron volt is defined as the kinetic energy of an electron accelerated through 
a potential difference of 1 volt. Thus, we have 


1 eV = (1.602 177 X 10 ~ 19 C) X (1.000 000 V) = 1.602 177 X 10 ~ 19 J 


The ground-state energy E\ of a hydrogen atom (Z = 1) as given by equation 
(6.48) is 


E 1 = -2.178 68 X 10 ~ 18 J = — 


13.598 eV 
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This is the energy required to remove the electron from the ground state of a 
hydrogen atom to a state of zero kinetic energy at infinity and is also known as 
the ionization potential of the hydrogen atom. 


Determination of the eigenfunctions 

Equation (6.47) may be used to obtain the ground state (n = 1, / = 0) eigen¬ 
function £i 0 (p). Introducing the definition of A n in equation (6.26a), we have 

A l s 10 = -^+^s 10 = o 


or 

d£io = £io 
dp 2 

from which it follows that 

£ 10 = cc - p ' 2 = 2 - 1 / 2 e-^ 2 

where the constant c of integration was evaluated by applying equations (6.25), 
(A.26), and (A.28). 

The series of eigenfunctions .S' 20 , £ 30 , • • • are readily obtained from equations 
(6.46) and (6.26b) with X = n, l = 0 

Bn^n— 1,0 hp 2 nSn0 


Thus, S 20 is 


^=K^H 2_,/vp/2 


2\Jl 


(2 - p)e 


and £30 is 


~ _ 1 ( d p 
530 “ 3 ( p dP~2 + 3 


2^2 


(2 — p)e 


-p /2 


= 6 A ( 6-6p + p 2 ,e-^ 

and so forth ad infinitum. Each eigenfunction is normalized. 

The eigenfunctions for / > 0 are determined in a similar manner. A general 
formula for the eigenfunction £/ +1/ , which is the starting function for evaluat¬ 
ing the series £„/ with fixed /, is obtained from equations (6.44) and (6.26a) 
with /=«=/+! 



The hydrogen atom 

Ai+\Si + \j = — ^p — + l^Si+ij = 0 


dSi +u 


= { , - 2 S ^ 


Integration gives 

S l+U = [(2/+ 2)!]- 1 /ye“ p/2 (6.49) 

where the integration constant was evaluated using equations (6.25), (A.26), 
and (A.28). 

The eigenfunction S 2 i from equation (6.49) is 

Sll= 2 V^ Pe ^ 

and equations (6.46) and (6.26b) for / = 1 give 

o 1 („ d P , ^ 1 „-*/2 


531 y/6\ P dp 2 + 3 j2V6 pe '~ 

= Y^( 4 - P)P q P/2 

5 41 = \/77dP-j--S + 4 )TT (4-P)pe 


40\ dp 2 


8730 


(20 — lOp + p )pe 


2 W"P / 2 


The functions S 31 , S 4 1 , ... are automatically normalized as specified by 
equation (6.25). The normalized eigenfunctions S„i{p) for l = 2, 3, 4, ... with 
« 3= (/ + 1 ) are obtained by the same procedure. 

A general formula for S„i involves the repeated application of Bk for 
k = / + 2, / + 3, ..., n — 1, n to Sz+i,/ in equation (6.49). The raising operator 
must be applied (n — l — 1) times. The result is 

S„i = ... (b 1+2,1) l B n B n -1 ... Bi + 2 Si+ij 

(l + 1)(2/ +1)! ] l/2 ( ±_P \ 

" «(«+/)!(« -/- 1 ) 1 ( 2 /+ 2 )lJ \ dp 2 7 

X ••• ( P ^“ f +/ + 2 ) p/e "' /2 (6 ’ 50) 

Just as equation (6.46) can be used to go ‘up the ladder’ to obtain S n j from 
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S n -\j, equation (6.44) allows one to go ‘down the ladder’ and obtain S„-\j 
from S„i. Taking the positive square root in going from equation (6.43) to 

(6.44) is consistent with taking the positive square root in going from equation 

(6.45) to (6.46); the signs of the functions S„i are maintained in the raising and 
lowering operations. In all cases the ladder operators yield normalized eigen¬ 
functions if the starting eigenfunction is normalized. 

The radial factors of the hydrogen-like atom total wave functions f(r, 6 , <p) 
are related to the functions S„i(p) by equation (6.23). Thus, we have 

/7\ 3 / 2 

R l0 = 2 —) e” p / 2 


R20 — 
R30 = 


— (— 
2 \/2 \ ti fi 

J_/Z 
9\/3 \a fl 


3/2 

(2 - p)e~ p/2 


3/2 

(6 — 6 p + p 2 )e~ p ! 2 


Ri\ 

i?31 

i?41 


1 /Z\ 3/2 


2y/6\ 


aj 


1 /Z\ 3/2 


aj 


9\/6 V 


32/15 \a fl 


pe p I 1 


(4 ~ P)pe p/1 

3/2 

(20 - lOp + p 2 )pe~ p/2 


and so forth. 

A more extensive listing appears in Table 6.1. 


Radial functions in terms of associated Laguerre polynomials 
The radial functions S n /(p) and R n i(r) may be expressed in terms of the 
associated Laguerre polynomials L J k (p), whose definition and mathematical 
properties are discussed in Appendix F. One method for establishing the 
relationship between S„i(p) and L[(p) is to relate S n /(p) in equation (6.50) to 
the polynomial L J k (p) in equation (F.15). That process, however, is long and 
tedious. Instead, we show that both quantities are solutions of the same 
differential equation. 
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Table 6.1. Radial functions R n ifor the hydrogen-like atom for 
n = 1 to 6. The variable p is given by p = 2Zrjna u 

Rio = 2 (Z/ a/l f' 2 e- p/2 


R 20 — 


Rn — 


(Z/af 3/2 

2f2 

(Z/af 3/2 

2\j6 


(2 - P)e- p/2 

pe~ p / 2 


R30 

R31 


R32 


(Z/ aff' 2 
9\f7> 

(Z/affl 2 

9\f0 

{Z/arf 12 

9s/30 


(6 -6 p + p 2 )e p ! 2 
(4 - p)p Q- p > 2 
p 2 e~ p l 2 


R 40 = (Z/ q£ )V2 (24 - 36 p + 12p 2 - p 3 )e- p/2 
96 

{Z/a u ) 3 1 2 , 

e - p / 2 

96\/35 


-^50 — 


/?51 = 


^52 — 
-^53 = 


^54 — 


(^/^) 3/2 

300^ 

(^Aq <) 3/2 

150\/30 

(^/^) 3/2 

150^ 

(^/^) 3/2 

300 v / 70 

(ZK ) 3/2 

900 s/70 


(120 - 240p + 120p 2 - 20p 3 + p 4 )e^ p/2 
(120 - 90p + 18p 2 - p 3 )p e~ p/2 
(42 — 14p + p 2 )p 2 e~ p,/2 

(8 - p)p 3 e~^ 2 


p 4 e 


p/2 


R60 


(Z/a ) 3 / 2 

' (720 - 1800p + 1200p 2 - 300p 3 + 30p 4 - p 5 )e~ p/2 

2160y6 
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Table 6.1. ( cont .) 


*61 

*62 

*63 

*64 

*65 


432V5I5 (840 - 840p + 252p 2 - 28p 3 + p 4 )p e~ p/2 

(Z/a,) 3 ' 2 , 


r(336 — 168p + 24p — p )p e 


3 3n 2 P -P/2 


864\/l05 
(Z/a ' l)3 Jl (72- lSp + pVe ^/ 2 


2592+15 
(Z/a,,) 3 ' 2 
12 960+7 


(10 — p)p 4 e p ! 2 


(Z/ ^ ) 3 / 2 . p 5 c - g/2 


12 960 +77 


We observe that the solutions +/(p) of the differential equation (6.24) 
contain the factor p l e~ p F. Therefore, we define the function F„i(p) by 

S„,(p) = F nl (p)p l ^ p/2 


and substitute this expression into equation (6.24) with a = n to obtain 
d 2 F i d F i 

p—f + (21 + 2-p)—^ + (n-l-l)F nl = 0 (6.51) 

ap z dp 

where we have also divided the equation by the common factor p. 

The differential equation satisfied by the associated Laguerre polynomials is 
given by equation (F.16) as 

d 2 /- 7 c\l J 

^ +(J+l - p) A +(t - j>Li = 0 

If we let k = n + / and j = 21 + 1, then this equation takes the form 


2 7-2/+1 


d 2 T 


72+/ 


dp 2 


d* 2/+1 

T (2 / + 2 — p )—- i - nr 2/ +* - 


dp 


+ (n~l- 1++7 =0 


(6.52) 


We have already found that the set of functions S n i(p) contains all the 
solutions to (6.24). Therefore, a comparison of equations (6.51) and (6.52) 
shows that F„i is proportional to L 2 / 4 , 1 . Thus, the function S n i(p) is related to 
the polynomial L 2 ^ 1 (p) by 

S nl (p) = cmp'e-rPlffiip) (6.53) 


The proportionality constants c„i in equation (6.53) are determined by the 
normalization condition (6.25). When equation (6.53) is substituted into (6.25), 
we have 
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p 2,+ ' e -< i [L 2 n , +;(p)] 2 dp = \ 


Jo 

The value of the integral is given by equation (F.25) with a 
j = 21 + 1 , so that 


n + / and 


2 2n[{n + /)!] 3 _ 

Cnl (n - l - 1)! 

and S„i(p) in equation (6.53) becomes 

SM = - (S) (6-54) 

Taking the negative square root maintains the sign of S„i(p). 

Equations (6.39) and (F.22), with S„i(p) and L J k (p) related by (6.54), are 
identical. From equations (F.23) and (F.24), we find 

r°o, )o ,, 2 , i /(«-/)(»+/+1 ) 

Snl(p)S n ±lAp)P dp = ~h - - -—- 

Jo V »(«+!) 

*oo 

S n i(p)Sn',i(p)p 2 dp = 0, n' ^ n,n ± 1 
Jo 

The normalized radial functions R n /(r) may be expressed in terms of the 
associated Laguerre polynomials by combining equations (6.22), (6.23), and 
(6.54) 


Rnl(r) 


4(n - l — 1)!Z 3 /2 Zr\ 
n 4 [(n + l)l] 3 a 3 fl \na^J 


& - Zr lna« L 2l+\ (2Zr l na!i) 


(6.55) 


Solution for positive energies 

There are also solutions to the radial differential equation (6.17) for positive 
values of the energy E, which correspond to the ionization of the hydrogen-like 
atom. In the limit r oo, equations (6.17) and (6.18) for positive E become 


d 2 R(r) 
dr 2 


2/uE 
h 2 


m 


0 


for which the solution is 


R(r) = ce ±Vf*E) l/2 r/h 

where c is a constant of integration. This solution has oscillatory behavior at 
infinity and leads to an acceptable, well-behaved eigenfunction of equation 
(6.17) for all positive eigenvalues E. Thus, the radial equation (6.17) has a 
continuous range of positive eigenvalues as well as the discrete set (equation 
(6.48)) of negative eigenvalues. The corresponding eigenfunctions represent 
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unbound or scattering states and are useful in the study of electron-ion 
collisions and scattering phenomena. In view of the complexity of the analysis 
for obtaining the eigenfunctions and eigenvalues of equation (6.17) for positive 
E and the unimportance of these quantities in most problems of chemical 
interest, we do not consider this case any further. 

Infinite nuclear mass 

The energy levels E n and the radial functions R„i(r) depend on the reduced 
mass p of the two-particle system 

m^me m c 

11 = -=- 

m N + m e m c 

m N 

where /% is the nuclear mass and m e is the electronic mass. The value of m e is 
9.10939 X 10 31 kg. For hydrogen, the nuclear mass is the protonic mass, 
1.672 62 X 1(T 27 kg, so that u is 9.1044 X 10 -31 kg. For heavier hydrogen-like 
atoms, the nuclear mass is, of course, greater than the protonic mass. In the 
limit —> oo, the reduced mass and the electronic mass are the same. In the 
classical two-particle problem of Section 6.1, this limit corresponds to the 
nucleus remaining at a fixed point in space. 

In most applications, the reduced mass is sufficiently close in value to the 
electronic mass m c that it is customary to replace u in the expressions for the 
energy levels and wave functions by m e . The parameter a u = ft 2 /ue' 2 is 
thereby replaced by ao = fi 2 /m e e' 2 . The quantity u 0 is, according to the earlier 
Bohr theory, the radius of the circular orbit of the electron in the ground state 
of the hydrogen atom (Z = 1) with a stationary nucleus. Except in Section 6.5, 
where this substitution is not appropriate, we replace u by m e and a„ by u 0 in 
the remainder of this book. 


6.4 Atomic orbitals 

We have shown that the simultaneous eigenfunctions ip{r, 6, <p) of the opera¬ 
tors H, L 2 , and L z have the form 

Vnimir, 9, < p) = film) = R„i(r)Yi m (6, cp) (6.56) 

where for convenience we have introduced the Dirac notation. The radial 
functions R„i(r) and the spherical harmonics T/ m (0, <p) are listed in Tables 6.1 
and 5.1, respectively. These eigenfunctions depend on the three quantum 
numbers n, /, and m. The integer n is called the principal or total quantum 
number and determines the energy of the atom. The azimuthal quantum 
number l determines the total angular momentum of the electron, while the 
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magnetic quantum number m determines the z-component of the angular 
momentum. We have found that the allowed values of n, l, and m are 

m = 0, ±1, ±2, ... 

/ = \m\, \m\ + 1, \m\ 4- 2, ... 
n = / -F 1, / 4- 2, l 4- 3, ... 

This set of relationships may be inverted to give 
n = 1, 2, 3, ... 

/ = 0, 1, 2, ..., n — 1 

m = — /, —/ + 1, ..., —1, 0, 1, ..., I — 1, / 

These eigenfunctions form an orthonormal set, so that 

(n' V m'\nlm) = d nn 'dn'd mm ' 

The energy levels of the hydrogen-like atom depend only on the principal 
quantum number n and are given by equation (6.48), with a u replaced by no, as 

Z 2 e' 2 

E„ = -~ -n = l, 2, 3, ... (6.57) 

2 aon- 

To find the degeneracy g n of E„, we note that for a specific value of n there are 
n different values of /. For each value of l, there are (21 + 1) different values of 
m, giving (21 + 1) eigenfunctions. Thus, the number of wave functions corre¬ 
sponding to n is given by 

n —1 n —1 n —1 

g„ = ^(2 /+l) = 2^/+^l 
1 =0 1=0 1=0 

The first summation on the right-hand side is the sum of integers from 0 to 
(n — 1) and is equal to n(n — l)/2 (n terms multiplied by the average value of 
each term). The second summation on the right-hand side has n terms, each 
equal to unity. Thus, we obtain 

g n = n(n - 1) + n = n 2 

showing that each energy level is « 2 -fold degenerate. The ground-state energy 
level E i is non-degenerate. 

The wave functions | nlm) for the hydrogen-like atom are often called atomic 
orbitals. It is customary to indicate the values 0, 1,2, 3, 4, 5, 6, 7, ... of the 
azimuthal quantum number / by the letters s, p, d, f, g, h, i, k, ..., respectively. 
Thus, the ground-state wave function 1100) is called the Is atomic orbital, 
|200) is called the 2s orbital, |210), |211), and |21 —1) are called 2p orbitals, 
and so forth. The first four letters, standing for sharp , principal, diffuse, and 
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fundamental, originate from an outdated description of spectral lines. The 
letters which follow are in alphabetical order with j omitted. 


s orbitals 

The 1 s atomic orbital | Is) is 

1 f Z\ 3//2 

| Is) = 1 100 ) = Rio(r)Y 00 (d, <p) = ( - J e~ Zr / a ° (6.58) 

where Rio(r) and 7 0 o(0, <p) are obtained from Tables 6.1 and 5.1. Likewise, the 
orbital | 2 s) is 

|2s> = |200,= ^t ,6 ' 59) 

and so forth for higher values of the quantum number n. The expressions for 
|«s) for n = 1, 2, and 3 are listed in Table 6.2. 

All the s orbitals have the spherical harmonic Y 00 (6, <p) as a factor. This 
spherical harmonic is independent of the angles 0 and tp, having a value 
(2 v / 7r)~ 1 . Thus, the s orbitals depend only on the radial variable r and are 
spherically symmetric about the origin. Likewise, the electronic probability 
density \xp\ 2 is spherically symmetric for s orbitals. 


p orbitals 

The wave functions for n = 2, / = 1 obtained from equation (6.56) are as 
follows: 

|2po) = |210) — ( Z / a o) _ rQ -Zr/2a 0 cos q (6.60a) 

4 v 2jt 

1 / 7 \ 5 / 2 

|2 Pl ) = |211) = —— — fQ-Zr/Zao sin @ e i<p (6.60b) 

8 \a o J 

1 / z \ 5 / 2 

12 p \) = |21 -1) = —— — re~ Zr ! 2a ° smOe^ (6.60c) 

oJT'l- \a o J 

The 2s and 2p 0 orbitals are real, but the 2pi and 2p | orbitals are complex. 
Since the four orbitals have the same eigenvalue E 2 , any linear combination of 
them also satisfies the Schrodinger equation (6.12) with eigenvalue E 2 . Thus, 
we may replace the two complex orbitals by the following linear combinations 
to obtain two new real orbitals 
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Table 6.2. Real wave functions for the hydrogen-like atom. The parameter a fl 

has been replaced by ao 


3s 

3p z 

3px 

3p v 

3d z 2 

3d xz 


3d 


7 Z 


State 


Wave function 


Spherical coordinates 

Cartesian coordinates 

Is 

(Z/a 0 f 2 Zr/ao 
v« 


2s 

< Z /««) 3/2 t, & i e -2r/2. 

4v^ V ao>/ 


2p z 

(Z/aof 2 re - Zr/2ao cosg 

4v2jt 

(Z/« 0 ) 5/2 z r / 2a „ 

4v^ 

2p A 

(Z/aff 2 re _zz /2ao sin 0 CQS 
4\phi 

(Z/affl 2 ^ c -Zr/2 ai> 

4 flit 

2Pv 

(Z/aof- re -2r /2a „ sin0sin(? , 
4\/2n 

( Z /a 0 ) 5/2 -Zr/2a 0 

4s/2jt y 


(Z/ fl0 ) 3/2 


Zr Z 2 r 2 
27- 18-b 2—z— 


- Zr / 2> ao 


81\/3jr y «o afi j 

81 \flii \ a o ) 

U-*), . e -^.. smecosip 

81 \/ 2 h \ a 0 ) 

e -z./,.. singsin 

81 \/ 2 jr V «o/ 

(Z/aoifl 2 3 cos 2 g-i) 

81 >/ 6 jr 

2 (Z/flo)_ — Zr/ 3 a 0 s j n g CQS g CQS 

81 \/ 2 jr 


2(Z/fl 0 ) 7 /- ^2 e -Zr/3 
81\/27r 


sin 6 cos 6 sin (p 


3d (Z/a °iL /2 r 2 Q- Zr /3ao &m 2 eC Os2<p 
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i /z \ 5 / 2 

| 2 px> = 2 _ 1 / 2 (| 2 Pl ) + | 2 p_i)) = A ^ l7i y /2 (^—J re~ Zr/2a ° sinflcos cp (6.61a) 

1 / 7 \ 5 / 2 

|2 P v) = -i2' 1/2 (|2 py) - | 2 p_!>) = 4(2 ^ 1/2 J re~ Zr/2a ° sinOsintp 

(6.61b) 

where equations (A.32) and (A.33) have been used. These new orbitals |2p r ) 
and | 2 p v ) are orthogonal to each other and to all the other eigenfunctions 
| nlm). The factor 2 ~ l/2 ensures that they are normalized as well. Although 
these new orbitals are simultaneous eigenfunctions of the Hamiltonian operator 
H and of the operator L 2 , they are not eigenfunctions of the operator L z . 

If we now substitute equations (5.29a), (5.29b), and (5.29c) into (6.61a), 
(6.61b), and (6.60a), respectively, we obtain for the set of three real 2 p orbitals 

' 2 p '>=4(2^© 5/2 «' a ' ,2 “" (6 ' 62a) 

^ = (6 ' 62b) 

(M2c) 

The subscript x, y, or z on a 2p orbital indicates that the angular part of the 
orbital has its maximum value along that axis. Graphs of the square of the 
angular part of these three functions are presented in Figure 6.2. The mathema¬ 
tical expressions for the real 2p and 3p atomic orbitals are given in Table 6.2. 


d orbitals 

The five wave functions for n = 3, / = 2 are 

1 / z \ 7 / 2 

|3do) = |320) =-— — r 2 e -(Zr/3a 0 )(3 cos 2 6 - 1) (6.63a) 

81V67T \CtoJ 

1 / 7 \ 7 / 2 

13d-ti) = |32 ± 1) = -——( — ) 7' 2 e _<z '’ ,/3ao) sin0cos0e ±1 ' ?> (6.63b) 

oXyJjt \aoJ 

1 / 7 \ 7 / 2 

|3d ±2 > = |32 ± 2) = —--= ( — ) r 2 e - (Zr/3ao) sin 2 0e ±i2<? (6.63c) 

162 a Jji \ao J 

The orbital |3do) is real. Substitution of equation (5.29c) into (6.63a) and a 
change in notation for the subscript give 
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Figure 6.2 Polar graphs of the hydrogen 2p atomic orbitals. Regions of positive and 
negative values of the orbitals are indicated by + and — signs, respectively. The 
distance of the curve from the origin is proportional to the square of the angular part 
of the atomic orbital. 


|3d z 2> 


1 

81 y/6jr 



r 2 )e _(Zr / 3a °) 


(6.64a) 


From the four complex orbitals |3di), |3d_i), |3d2), and |3d_2)> we construct 
four equivalent real orbitals by the relations 


21/ 2 / 7 \ 7 / 2 

|3d, z ) = 2 _1 / 2 (|3d 1 ) + |3d_!» = - 2 (-j xzQ~ (Zr ^ aa) (6.64b) 

21/2 / y\ 7/ 2 

|3d yz ) = - |3d_0) = yze-^/^ (6.64c) 
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|3d,_,.> . 2- 1 /*(|3d 2 > + |3d_2» = (|) ? V - 

(6.64d) 

01/2 / y\ 7/2 

|3d xv ) = —i2 _1/2 (|3d 2 ) - |3d_ 2 >) = xye~ (Z '' /3ao) (6.64e) 

In forming |3d Y 2_ v a) and |3d A - v ), equations (A.37) and (A.38) were used. Graphs 
of the square of the angular part of these five real functions are shown in Figure 
6.3 and the mathematical expressions are listed in Table 6.2. 


Radial functions and expectation values 

The radial functions R„i(r) for the Is, 2s, 2p, 3s, 3p, and 3d atomic orbitals are 
shown in Figure 6.4. For states with If 0, the radial functions vanish at the 
origin. For states with no angular momentum (/ = 0), however, the radial 
function R n o(r ) has a non-zero value at the origin. The function R„i(r) has 
(« — l — 1) nodes between 0 and oo, i.e., the function crosses the r-axis 
(n — l — 1) times, not counting the origin. 

The probability of finding the electron in the hydrogen-like atom, with the 
distance r from the nucleus between r and r + dr, with angle 9 between 9 and 
0 + AO, and with the angle tp between tp and <p + dtp is 

\fnim \ 2 dr = \Rni(r)] 2 \Y lm (d, cp)\ 2 r 2 sin6 dr A6 dtp 

To find the probability D„i(r) dr that the electron is between r and r + dr 
regardless of the direction, we integrate over the angles 9 and tp to obtain 


D n i(r) dr 


r 2 [R„,(r)] 2 dr 


*2 71 

| Yi m (9, tp)\ 2 sin 9 d 9 dtp 
o 


r 2 \Rn,(r)] 2 dr 

(6.65) 


Since the spherical harmonics are normalized, the value of the double integral 
is unity. 

The radial distribution function D ni ( r) is the probability density for the 
electron being in a spherical shell with inner radius r and outer radius r + dr. 
For the Is, 2s, and 2p states, these functions are 
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Figure 6.3 Polar graphs of the hydrogen 3d atomic orbitals. Regions of positive and 
negative values of the orbitals are indicated by + and — signs, respectively. The 
distance of the curve from the origin is proportional to the square of the angular part 
of the atomic orbital. 
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D w (r) = 4(—) r 2 e~ 2Zr/ao 
\ao ) 

Diuir) = (-) r 2 ( 2- — 

8 \a 0 J \ a 0 


1 


Zr/ a 0 


( 6 . 66 ) 


Higher-order functions are readily determined from Table 6.1. The radial 
distribution functions for the Is, 2s, 2p, 3s, 3p, and 3d states are shown in 
Figure 6.5. 

The most probable value r mp of r for the Is state is found by setting the 
derivative of Ao(/) equal to zero 

3 


d A pH ) 
dr 


= 8 ( —^ A l - — ) e - 2Zr / fl0 =0 


do] 


Zr 


fl 0 


which gives 

f'mp = do/Z (6.67) 

Thus, for the hydrogen atom (Z = 1) the most probable distance of the electron 
from the nucleus is equal to the radius of the first Bohr orbit. 

The radial distribution functions may be used to calculate expectation values 
of functions of the radial variable r. For example, the average distance of the 
electron from the nucleus for the 1 s state is given by 


(r)ls = 


■o° / z \ 3 

rD\o(r)dr = 4 — 
o \doJ 


r 3 e 2Zr / a « d r = (6.68) 


where equations (A.26) and (A.28) were used to evaluate the integral. By the 
same method, we find 


6a 0 

V)2^ = —, 


5ao 

\n 2p = 


The expectation values of powers and inverse powers of r for any arbitrary 
state of the hydrogen-like atom are defined by 


(r k )n,= 


r k D n /(r) dr = 


r K [Rni(r)fr 2 dr 


(6.69) 


In Appendix H we show that these expectation values obey the recurrence 
relation 


-—y-{r k )ni ~ (2& + 1)°^ {r k ~ x ) „i + k 


1(1 + 1) T 


1 - k 2 


“0 / k—1\ n 

^ 2 V )nl-V 
(6.70) 
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For k 


For k 


or 


For k 


0, equation (6.70) gives 


( r X )m = 


1, equation (6.70) gives 


4(r)„-^+/(/ + l)4( r - , >„ = 0 


M,/ = H [3 " 2 -'('+!)] 


2, equation (6.70) gives 
3 . 9 . 5 do 


^2 - -^r( r )nl + 2 [/(/ + 1 ) — — 0 


(6.71) 


(6.72) 


or 

(r 2 >„/ = ^[5« 2 -3/(/+l)+l] 

For higher values of A', equation (6.70) leads to ( r 3 ),,/, ( r4 )»/> ■ • ■ 
For k = — 1, equation (6.70) relates (r _3 )„/ to (r -2 ),,/ 




(6.73) 


(6.74) 


For A' = —2, —3, ..., equation (6.70) gives successively (. r~ A ) nh ( r~ 5 ) nh ... 
expressed in terms of (r _2 )„/. 

Although the expectation value ( r~ 2 )„i cannot be obtained from equation 
(6.70), it can be evaluated by regarding the azimuthal quantum number / as the 
parameter in the Hellmann-Feynman theorem (equation (3.71)). Thus, we 
have 


8E n _ /8H,\ 
01 \dl / 


(6.75) 


where the Hamiltonian operator Hi is given by equation (6.18) and the energy 
levels E„ by equation (6.57). The derivative dHi/dl is just 


d Hi 

dl 


ft 2 

2 fir 2 


( 2 /+ 1 ) 


(6.76) 


In the derivation of (6.57), the quantum number n is shown to be the value of / 
plus a positive integer. Accordingly, we have dn/dl = 1 and 


dE n ZV 2 d _2 ZV 2 dn d _ 2 Z 2 h 2 

-=- n — - ti — - fi 

dl 2ao dl 2 ciq dl dn /.ial 


(6.77) 


where a fl = h 2 1 fie' 1 has been replaced by a 0 = h 2 /m e e' 2 . Substitution of 
equations (6.76) and (6.77) into (6.75) gives the desired result 
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( r 2 )m = 


n 3 (l + \)al 


(6.78) 


Expression (6.71) for the expectation value of r _1 may be used to calculate 
the average potential energy of the electron in the state | nlm). The potential 
energy V(r) is given by equation (6.13). Its expectation value is 

(V) nl = —Ze' 2 (r~ l ) nl = — ——y (6.79) 

ci() n 


The result depends only on the principal quantum number n, so we may drop 
the subscript /. A comparison with equation (6.57) shows that the total energy 
is equal to one-half of the average potential energy 

E n = \(V)n (6.80) 

Since the total energy is the sum of the kinetic energy T and the potential 
energy V, we also have the expression 

Z 2 e' 2 

T n = —E n = - -j (6.81) 

2cion- 

The relationship E n = — T„ = (V n /2) is an example of the quantum-mechani¬ 
cal vtrial theorem. 


6.5 Spectra 


The theoretical results for the hydrogen-like atom may be related to experimen¬ 
tally measured spectra. Observed spectral lines arise from transitions of the 
atom from one electronic energy level to another. The frequency v of any given 
spectral line is given by the Planck relation 

v = (E 2 — Ei)/h 


where E\ is the lower energy level and E 2 the higher one. In an absorption 
spectrum, the atom absorbs a photon of frequency v and undergoes a transition 
from a lower to a higher energy level (E\ —> E 2 ). In an emission spectrum, the 
process is reversed; the transition is from a higher to a lower energy level 
( E 2 — Ei) and a photon is emitted. A spectral line is usually expressed as a 
wave number v, defined as the reciprocal of the wavelength X 


_ _ 1 v 

X c 


\E 2 - Ei 
he 


(6.82) 


The hydrogen-like atomic energy levels are given in equation (6.48). If n\ and 
n 2 are the principal quantum numbers of the energy levels E\ and E 2 , 
respectively, then the wave number of the spectral line is 



188 


The hydrogen atom 


Table 6.3. Rydberg constant for 
hydrogen-like atoms 


Atom 

R (cm ') 

>H 

109 677.58 

2 h(D) 

109 707.42 

4 He+ 

109 722.26 

7 Li 2 + 

109 728.72 

9 Be 3+ 

109 730.62 

oo 

109 737.31 


v = 



where the Rydberg constant R is given by 


R 


fie 


14 


4jrfi 3 c 


»2 > «1 


(6.83) 


(6.84) 


The value of the Rydberg constant varies from one hydrogen-like atom to 
another because the reduced mass fi is a factor. It is not appropriate here to 
replace the reduced mass fi by the electronic mass m c because the errors 
caused by this substitution are larger than the uncertainties in the experimental 
data. The measured values of the Rydberg constants for the atoms *14, 4 He + , 
7 Li 2+ , and 9 Be 3+ are listed in Table 6.3. Following the custom of the field of 
spectroscopy, we express the wave numbers in the unit cnR 1 rather than the SI 
unit m 1 . Also listed in Table 6.3 is the extrapolated value of R for infinite 
nuclear mass. The calculated values from equation (6.84) are in agreement 
with the experimental values within the known number of significant figures 
for the fundamental constants m e , e', and h and the nuclear masses n?y. The 
measured values of R have more significant figures than any of the quantities 
in equation (6.84) except the speed of light c. 

The spectrum of hydrogen (Z = 1) is divided into a number of series of 
spectral lines, each series having a particular value for n\. As many as six 
different series have been observed: 


= 1, 

Lyman series 

ultraviolet 

= 2, 

Balmer series 

visible 

= 3, 

Paschen series 

infrared 

= 4, 

Brackett series 

infrared 

= 5, 

Pfund series 

far infrared 


— 6, Humphreys series very far infrared 
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series 

Figure 6.6 Energy levels for the hydrogen atom. 

Thus, transitions from the lowest energy level n\ = 1 to the higher energy 
levels n 2 = 2, 3, 4, ... give the Lyman series, transitions from n { =2 to 
«2 = 3, 4, 5, ... give the Balmer series, and so forth. An energy level diagram 
for the hydrogen atom is shown in Figure 6.6. The transitions corresponding to 
the spectral lines in the various series are shown as vertical lines between the 
energy levels. 
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Figure 6.7 A typical series of spectral lines for a hydrogen-like atom shown in terms of 
the wave number v. 


A typical series of spectral lines is shown schematically in Figure 6.7. The 
line at the lowest value of the wave number v corresponds to the transition 
n\ —► («2 = »i + 1), the next line to n\ —> («2 = «i + 2), and so forth. These 
spectral lines are situated closer and closer together as n 2 increases and 
converge to the series limit, corresponding to n 2 = oo. According to equation 
(6.83), the series limit is given by 

v = R /n\ (6.85) 

Beyond the series limit is a continuous spectrum corresponding to transitions 
from the energy level n\ to the continuous range of positive energies for the 
atom. 

The reduced mass of the hydrogen isotope 2 F1, known as deuterium, slightly 
differs from that of ordinary hydrogen 1 H. Accordingly, the Rydberg constants 
for hydrogen and for deuterium differ slightly as well. Since naturally occurring 
hydrogen contains about 0.02% deuterium, each observed spectral line in 
hydrogen is actually a doublet of closely spaced lines, the one for deuterium 
much weaker in intensity than the other. This effect of nuclear mass on spectral 
lines was used by Urey (1932) to prove the existence of deuterium. 


Pseudo-Zeeman effect 

The influence of an external magnetic field on the spectrum of an atom is 
known as the Zeeman effect. The magnetic field interacts with the magnetic 
moments within the atom and causes the atomic spectral lines to split into a 
number of closely spaced lines. In addition to a magnetic moment due to its 
orbital motion, an electron also possesses a magnetic moment due to an 
intrinsic angular momentum called spin. The concept of spin is discussed in 
Chapter 7. In the discussion here, we consider only the interaction of the 
external magnetic field with the magnetic moment due to the electronic orbital 
motion and neglect the effects of electron spin. Thus, the following analysis 
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does not give results that correspond to actual observations. For this reason, we 
refer to this treatment as the pseudo-Zeeman effect. 

When a magnetic field B is applied to a hydrogen-like atom with magnetic 
moment M, the resulting potential energy V is given by the classical expression 

F = -M-B = ^L-B (6.86) 

n 

where equation (5.81) has been introduced. If the z-axis is selected to be 
parallel to the vector B, then we have 

V = p&BL z /ii (6.87) 


If we replace the z-component of the classical angular momentum in equation 
(6.87) by its quantum-mechanical operator, then the Hamiltonian operator H B 
for the hydrogen-like atom in a magnetic field B becomes 


H b = H + ^-L 
n 


( 6 . 88 ) 


where H is the Hamiltonian operator (6.14) for the atom in the absence of the 
magnetic field. Since the atomic orbitals ip„i m in equation (6.56) are simultan¬ 
eous eigenfunctions of H, L 2 , and L z , they are also eigenfunctions of the 
operator H B . Accordingly, we have 


IlnfnUn =\H+^-l 


V nim — ( E n nut\\ B) If' n l, n 


(6.89) 


where E„ is given by (6.48) and equation (6.15c) has been used. Thus, the 
energy levels of a hydrogen-like atom in an external magnetic field depend on 
the quantum numbers n and m and are given by 


F — — 

J - / nm — 


zV 2 

2 a /t n 2 


+ nijU R B, 


n = 1, 2, 


m = 0, ±1, 


±{n - 1) 
(6.90) 


This dependence on m is the reason why m is called the magnetic quantum 
number. 

The degenerate energy levels for the hydrogen atom in the absence of an 
external magnetic field are split by the magnetic field into a series of closely 
spaced levels, some of which are non-degenerate while others are still 
degenerate. For example, the energy level Z 3 for n = 3 is nine-fold degenerate 
in the absence of a magnetic field. In the magnetic field, this energy level is 
split into five levels: Z 3 (triply degenerate), £ 3 + u R B (doubly degenerate), 
Z 3 — u r B (doubly degenerate), Z 3 + 2u R B (non-degenerate), and Z 3 — 2« B B 
(non-degenerate). Energy levels for s orbitals (/ = 0) are not affected by the 
application of the magnetic field. Energies for p orbitals (7 = 1) are split by the 
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magnetic field into three levels. For d orbitals (7 = 2), the energies are split into 
five levels. 

This splitting of the energy levels by the magnetic field leads to the splitting 
of the lines in the atomic spectrum. The wave number v of the spectral line 
corresponding to a transition between the state \n\l\m\) and the state \n 2 hm 2 ) 
is 

v = = RZ 2 f\- - nii), n 2 > «i (6.91) 

he \n\ n^J he 

Transitions between states are subject to certain restrictions called selection 
rules. The conservation of angular momentum and the parity of the spherical 
harmonics limit transitions for hydrogen-like atoms to those for which 
A/ = ±1 and for which Am = 0, ±1. Thus, an observed spectral line v 0 in the 
absence of the magnetic field, given by equation (6.83), is split into three lines 
with wave numbers v 0 + (/.i b B/ he), v 0 , and v 0 — (ju b B/ he). 


6.1 

6.2 

6.3 

6.4 

6.5 

6.6 

6.7 

6.8 

6.9 


6.10 

6.11 


Problems 


Obtain equations (6.28) from equations (6.26). 

Evaluate the commutator [Ax, Bx\ where the operators A) and B) are those in 
equations (6.26). 

Show explicitly by means of integration by parts that the operator Hi in equation 
(6.18) is hermitian for a weighting function equal to r 2 . 

Demonstrate by means of integration by parts that the operator H\ in equation 
(6.36) is hermitian for a weighting function w(p) — p. 

Show that (Ax + l)Sx+i,i = ax+ijSxi and that (B A + \)S 2 i = bx+ijSx+i.i- 
Derive equation (6.45) from equation (6.34). 

Derive the relationship 


fl-nl 


SniSn—\,iP dp b n -\-\j 


n-\-\,lP dp — 1 


JO Jo 

Evaluate (r ')«/ for the hydrogen-like atom using the properties of associated 
Laguerre polynomials. First substitute equations (6.22) and (6.55) into (6.69) for 
k — — 1. Then apply equations (F.22) to obtain (6.71). 

From equation (F.19) with v — 2, show that 


„2/+3 c 


[L 2 n , +l(p)fdp 


2[3n 2 - /(/ +!)][(«+ /)!] 3 


to (n — l — 1)! 

Then show that (/')„/ is given by equation (6.72). 

Show that ( r) 2s = 6no / Z using the appropriate radial distribution function in 
equations (6.66). 

Set 2 = e' in the Hcllmann -Feynman theorem (3.71) to obtain (r ')„/ for the 
hydrogen-like atom. Note that a 0 depends on e'. 
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6.12 Show explicitly for a hydrogen atom in the Is state that the total energy E\ is 
equal to one-half the expectation value of the potential energy of interaction 
between the electron and the nucleus. This result is an example of the quantum- 
mechanical virial theorem. 

6.13 Calculate the frequency, wavelength, and wave number for the series limit of the 
Balmer series of the hydrogen-atom spectral lines. 

6.14 The atomic spectrum of singly ionized helium He + with n\ — 4, n 2 = 5,6,... is 
known as the Pickering series. Calculate the energy differences, wave numbers, 
and wavelengths for the first three lines in this spectrum and for the series limit. 

6.15 Calculate the frequency, wavelength, and wave number of the radiation emitted 
from an electronic transition from the third to the first electronic level of Li 2+ . 
Calculate the ionization potential of Li 2+ in electron volts. 

6.16 Derive an expression in terms of for the difference in wavelength, 
AA — /.h — Ad, between the first line of the Balmer series (n\ — 2) for a 
hydrogen atom and the corresponding line for a deuterium atom? Assume that 
the masses of the proton and the neutron are the same. 
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Spin 


7.1 Electron spin 

In our development of quantum mechanics to this point, the behavior of a 
particle, usually an electron, is governed by a wave function that is dependent 
only on the cartesian coordinates x, y, z or, equivalently, on the spherical 
coordinates r, 6, cp. There are, however, experimental observations that cannot 
be explained by a wave function which depends on cartesian coordinates alone. 

In a quantum-mechanical treatment of an alkali metal atom, the lone valence 
electron may be considered as moving in the combined field of the nucleus and 
the core electrons. In contrast to the hydrogen-like atom, the energy levels of 
this valence electron are found to depend on both the principal and the 
azimuthal quantum numbers. The experimental spectral line pattern corre¬ 
sponding to transitions between these energy levels, although more complex 
than the pattern for the hydrogen-like atom, is readily explained. However, in a 
highly resolved spectrum, an additional complexity is observed; most of the 
spectral lines are actually composed of two lines with nearly identical wave 
numbers. In an alkaline-earth metal atom, which has two valence electrons, 
many of the lines in a highly resolved spectrum are split into three closely 
spaced lines. The spectral lines for the hydrogen atom, as discussed in Section 
6.5, are again observed to be composed of several very closely spaced lines, 
with equation (6.83) giving the average wave number of each grouping. The 
splitting of the spectral lines in the alkali and alkaline-earth metal atoms and in 
hydrogen cannot be explained in terms of the quantum-mechanical postulates 
that are presented in Section 3.7, i.e., they cannot be explained in terms of a 
wave function that is dependent only on cartesian coordinates. 

G. E. Uhlenbeck and S. Goudsmit (1925) explained the splitting of atomic 
spectral lines by postulating that the electron possesses an intrinsic angular 
momentum, which is called spin. The component of the spin angular momen- 
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turn in any direction has only the value h/2 or —h/2. This spin angular 
momentum is in addition to the orbital angular momentum of the electronic 
motion about the nucleus. They further assumed that the spin imparts to the 
electron a magnetic moment of magnitude eh/2m e , where —e and m c are the 
electronic charge and mass. The interaction of an electron’s magnetic moment 
with its orbital motion accounts for the splitting of the spectral lines in the 
alkali and alkaline-earth metal atoms. A combination of spin and relativistic 
effects is needed to explain the fine structure of the hydrogen-atom spectrum. 

The concept of spin as introduced by Uhlenbeck and Goudsmit may also be 
applied to the Stern-Gerlach experiment, which is described in detail in 
Section 1.7. The explanation for the splitting of the beam of silver atoms into 
two separate beams by the external inhomogeneous magnetic field requires the 
introduction of an additional parameter to describe the behavior of the odd 
electron. Thus, the magnetic moment of the silver atom is attributed to the odd 
electron possessing an intrinsic angular momentum which can have one of only 
two distinct values. 

Following the hypothesis of electron spin by Uhlenbeck and Goudsmit, P. A. 
M. Dirac (1928) developed a quantum mechanics based on the theory of 
relativity rather than on Newtonian mechanics and applied it to the electron. 
He found that the spin angular momentum and the spin magnetic moment of 
the electron are obtained automatically from the solution of his relativistic 
wave equation without any further postulates. Thus, spin angular momentum is 
an intrinsic property of an electron (and of other elementary particles as well) 
just as are the charge and rest mass. 

In classical mechanics, a sphere moving under the influence of a central 
force has two types of angular momentum, orbital and spin. Orbital angular 
momentum is associated with the motion of the center of mass of the sphere 
about the origin of the central force. Spin angular momentum refers to the 
motion of the sphere about an axis through its center of mass. It is tempting to 
apply the same interpretation to the motion of an electron and regard the spin 
as the angular momentum associated with the electron revolving on its axis. 
However, as Dirac’s relativistic quantum theory shows, the spin angular 
momentum is an intrinsic property of the electron, not a property arising from 
any kind of motion. The electron is a structureless point particle, incapable of 
‘spinning’ on an axis. In this regard, the term ‘spin’ in quantum mechanics can 
be misleading, but its use is well-established and universal. 

Prior to Dirac’s relativistic quantum theory, W. Pauli (1927) showed how spin 
could be incorporated into non-relativistic quantum mechanics. Since the 
subject of relativistic quantum mechanics is beyond the scope of this book, we 
present in this chapter Pauli’s modification of the wave-function description so 
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as to include spin. His treatment is equivalent to Dirac’s relativistic theory in 
the limit of small electron velocities (v/c —> 0). 


7.2 Spin angular momentum 

The postulates of quantum mechanics discussed in Section 3.7 are incomplete. 
In order to explain certain experimental observations, Uhlenbeck and Goudsmit 
introduced the concept of spin angular momentum for the electron. This 
concept is not contained in our previous set of postulates; an additional 
postulate is needed. Further, there is no reason why the property of spin should 
be confined to the electron. As it turns out, other particles possess an intrinsic 
angular momentum as well. Accordingly, we now add a sixth postulate to the 
previous list of quantum principles. 


6. A particle possesses an intrinsic angular momentum S and an associated magnetic 
moment M s . This spin angular momentum is represented by a hermitian operator S 
which obeys the relation SXS = ihS. Each type of particle has a fixed spin 
quantum number or spin s from the set of values s = 0,1,, 1, §, 2, ... The spin 5 for 
the electron, the proton, or the neutron has a value The spin magnetic moment for 
the electron is given by M s = — eS/ m e . 

As noted in the previous section, spin is a purely quantum-mechanical concept; 
there is no classical-mechanical analog. 

The spin magnetic moment M s of an electron is proportional to the spin 
angular momentum S, 


M s 


gst*B B 

2m e h 


(7.1) 


where g s is the electron spin gyromagnetic ratio and the Bohr magneton /t B is 
defined in equation (5.82). The experimental value of g s is 2.002 319 304 and 
the value predicted by Dirac’s relativistic quantum theory is exactly 2. The 
discrepancy is removed when the theory of quantum electrodynamics is 
applied. We adopt the value g s = 2 here. A comparison of equations (5.81) and 
(7.1) shows that the proportionality constant between magnetic moment and 
angular momentum is twice as large in the case of spin. Thus, the spin 
gyromagnetic ratio for the electron is twice the orbital gyromagnetic ratio. The 
spin gyromagnetic ratios for the proton and the neutron differ from that of the 
electron. 

The hermitian spin operator S associated with the spin angular momentum S 
has components S x , S y , S z , so that 
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S — i>S’, + \ Sy + kS z 
s 2 = S 2 X + S 2 y + Sl 


These components obey the commutation relations 


[5,, Sy] = i hS z , 

[S y , S z ] = i HS X , [S z , S x ] = i hSy 

(7.2) 

or, equivalently 




SXS = i(iS 

(7.3) 


Thus, the quantum-mechanical treatment of generalized angular momentum 
presented in Section 5.2 may be applied to spin angular momentum. The spin 
operator S is identified with the operator J and its components S x , S v , S z with 
J x , J y , J z . Equations (5.26) when applied to spin angular momentum are 

S 2 \sm s ) = s(s + l)h 2 \sm s ), s = 0 , 1, |, 2 , ... (7.4) 

S z \sm s ) = m s h\sm s ), m s = — s — s + 1, ..., s — 1, s (7.5) 

where the quantum numbers j and m are now denoted by s and m s . The 
simultaneous eigenfunctions | sm s ) of the hermitian operators S 2 and S z are 


orthonormal 

(s'm' s \sm s ) =d ss 'd mstn ' s (7.6) 

The raising and lowering operators for spin angular momentum as defined by 
equations (5.18) are 

5+ = S x + iS y (7.7a) 

52 = S x - iS y (7.7b) 

and equations (5.27) take the form 

5' + |5 , m i ) - — m s )(s + m s + 1) h\s, m s + 1) (7.8a) 

= sj(s + m s )(s - m s + 1) h\s, m s - 1) (7.8b) 


In general, the spin quantum numbers s and m s can have integer and half¬ 
integer values. Although the corresponding orbital angular-momentum quan¬ 
tum numbers / and m are restricted to integer values, there is no reason for 
such a restriction on s and m s . 

Every type of particle has a specific unique value of s, which is called the 
spin of that particle. The particle may be elementary, such as an electron, or 
composite but behaving as an elementary particle, such as an atomic nucleus. 
All 4 He nuclei, for example, have spin 0; all electrons, protons, and neutrons 
have spin all photons and deuterons ( 2 H nuclei) have spin 1; etc. Particles 
with spins 0, 1, 2, ... are called bosons and those with spins ... are 
fermions. A many particle system of bosons behaves differently from a many 
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particle system of fermions. This quantum phenomenon is discussed in Chap¬ 
ter 8. 

The state of a particle with zero spin (s = 0) may be represented by a state 
function T ; ( r, t ) of the spatial coordinates r and the time t. However, the state 
of a particle having spin ^ 0) must also depend on some spin variable. We 
select for this spin variable the component of the spin angular momentum 
along the z-axis and use the quantum number m s to designate the state. Thus, 
for a particle in a specific spin state, the state function is denoted by 
^(r, m s , t), where m s has only the (2s + 1) possible values —sh, (—5+ 1 )h, 

..., {s — \)h, sh. While the variables r and t have a continuous range of 
values, the spin variable m s has a finite number of discrete values. 

For a particle that is not in a specific spin state, we denote the spin variable 
by o. A general state function v F(r, o, t ) for a particle with spin s may be 
expanded in terms of the spin eigenfunctions | sm s ), 

S 

W(r, o, t)= Y2 t)\sm s ) (7.9) 

m s =—s 


If ^(r, a, t) is normalized, then we have 


<V|V}= £ 

m s ——s 


I'lfir, m s , t )| 2 dr = 1 


where the orthonormal relations (7.6) have been used. The quantity 
|W(r, m s , t )| 2 is the probability density for finding the particle at r at time t 
with the z-component of its spin equal to m s h. The integral J |W(r, m s , t )| 2 dr 
is the probability that at time t the particle has the value m s h for the z- 
component of its spin angular momentum. 


7.3 Spin one-half 

Since electrons, protons, and neutrons are the fundamental constituents of 
atoms and molecules and all three elementary particles have spin one-half, the 
case s = j is the most important for studying chemical systems. For s = \ there 
are only two eigenfunctions, \\, |) and ~4). For convenience, the state s = 
m s = ( is often called spin up and the ket ||, |) is written as [ j) or as |a). 
Likewise, the state s = m s = is called spin down with the ket \\, — i) 
often expressed as ||) or |/3). Equation (7.6) gives 

(a|a) = = 1, (a\P) = 0 (7.10) 

The most general spin state \%) for a particle with s = \ is a linear com¬ 
bination of |a) and |/3) 
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\x) = C a \o) +cp\p) (7.11) 

where c a and cp are complex constants. If the ket \%) is normalized, then 
equation (7.10) gives 

\Ca\ 2 + |C/j| 2 = 1 

The ket \%) may also be expressed as a column matrix, known as a spinor 



where the eigenfunctions |a) and \f>) in spinor notation are 


l«)=(o), ( 7 - 13 ) 

Equations (7.4), (7.5), and (7.8) for the s = \ case are 

S 2 \a)=\h 2 \a), S 2 \fi) = \h 2 \P) (7.14) 

S z \a) = \h\a), S z \P) = ~\m (7.15) 

S + \a) = 0, S-\P) = 0 (7.16a) 

S+\P) = h\o), S-\a) = h\P) (7.16b) 


Equations (7.16) illustrate the behavior of S+ and .S’_ as ladder operators. The 
operator S + ‘raises’ the state \( J >) to state |a), but cannot raise |a) any further, 
while S _ Towers’ |a) to \(i), but cannot lower \[i). From equations (7.7) and 
(7.16), we obtain the additional relations 

S x \a) = \h\p), S x \p) = \h\a) (7.17a) 

S y \a) = fi\p), S y 1/3) = -\h\a) (7.17b) 

We next introduce three operators o x , o y , o z which satisfy the relations 

S x = \ho x , Sy = \ho y , S z = \ho z (7.18) 

From equations (7.15) and (7.17), we find that the only eigenvalue for each of 
the operators o 2 x , o 2 v , o\ is I. Thus, each squared operator is just the identity 
operator 

a 2 x = a 2 y = al = 1 (7.19) 

According to equations (7.2) and (7.18), the commutation rules for o x , o y , o z 
are 

[o x , Oy] = 2i o z , [o y , o z ] = 2io x , [o z , o x \ = 2^ (7.20) 

The set of operators o x , o y , o z anticommute, a property which we demon¬ 
strate for the pair o x , o y as follows 
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2i(a x o y + o y o x ) = (2ia x )Oy + a y (2ia x ) 

= (OyO z - O z Oy)Oy + O y(0 yO 2 ~ O -O y) 

= -o : o\ + o 2 y o z 
= 0 

where the second of equations (7.20) and equation (7.19) have been used. The 
same procedure may be applied to the pairs o v , o z and o x , o z , giving 

(o x o y + o y o x ) = ( o y o z + o z o y ) = (o -o x + o x 0 z ) = 0 (7.21) 

Combining equations (7.20) and (7.21), we also have 

o x o y = ia 2 , o y o z = i o x , o z o x = i o y (7.22) 


Pauli spin matrices 

An explicit set of operators o x , o y , o - with the foregoing properties can be 
formed using 2X2 matrices. The properties of matrices are discussed in 
Appendix I. In matrix notation, equation (7.19) is 


2 2 2 
= a -- 



(7.23) 


We let o z be represented by the simplest 2X2 matrix with eigenvalues 1 and 
-1 

1 0 


o- = 


0 -1 


(7.24) 


To find o r and o y , we note that 


a b \ / 1 0 

c d)\0 -1 


and 


1 0 
0 -1 


a b 
c d 


a —b 
c —d 


a b 
—c —d 


Since o x and o ,, anticommute with o z as represented in (7.24), we must have 


a —b 
c —d 


—a —b 
c d 


so that a = d = 0 and both o x and o y have the form 

0 b 
c 0 


Further, we have from (7.23) 

0 /A/0 b 
c 0 j { c 0 


2 2 
^ = 


be 0 
0 be 


1 0 
0 1 
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giving the relation be = 1. If we select b = c = 1 for o x , then we have 

0 1 


Or = 


1 0 


The third of equations (7.22) determines that o y must be 

0 -i' 


°y = 


i 0 


In summary, the three matrices are 


Or = 


0 1 
1 0 


°y = 


0 -i 
i 0 


o 7 = 


1 0 
0 -1 


and are known as the Pauli spin matrices. 

The traces of the Pauli spin matrices vanish 

Tr<7 x = Tr o y = Tr o z = 0 
and their determinants equal — 1 


detoy = deter v = deter- = —1 


The unit matrix I 


I 


1 0 
0 1 


(7-25) 


and the three Pauli spin matrices in equation (7.25) form a complete set of 
2X2 matrices. Any arbitrary 2X2 matrix M can always be expressed as the 
linear combination 

M = ci I + c 2 o x + c 3 o y + c 4 cr- 
where c\, c 2 , c 3 , c 4 are complex constants. 


7.4 Spin-orbit interaction 

The spin magnetic moment M s of an electron interacts with its orbital magnetic 
moment to produce an additional term in the Hamiltonian operator and, 
therefore, in the energy. In this section, we derive the mathematical expression 
for this spin-orbit interaction and apply it to the hydrogen atom. 

With respect to a coordinate system with the nucleus as the origin, the 
electron revolves about the fixed nucleus with angular momentum L. However, 
with respect to a coordinate system with the electron as the origin, the nucleus 
revolves around the fixed electron. Since the revolving nucleus has an electric 
charge, it produces at the position of the electron a magnetic field B parallel to 
L. The interaction of the spin magnetic moment M s of the electron with this 
magnetic field B gives rise to the spin-orbit coupling with energy —M s • B. 
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According to the Biot and Savart law of electromagnetic theory, 1 the 
magnetic field B at the ‘fixed’ electron due to the revolving positively charged 
nucleus is given in SI units to first order in v/c by 

B = l(EXv n ) (7.26) 

c L 

where E is the electric field due to the revolving nucleus, v n is the velocity of 
the nucleus relative to the electron, and c is the speed of light. The electric 
force F is related to E and the potential energy V(r ) of interaction between the 
nucleus and the electron by 

F = -eE = -VV 


Thus, the electric field at the electron is 

_ r n d V(r) 


(7.27) 


er dr 

where r n is the vector distance of the nucleus from the electron. The vector r 
from nucleus to electron is — r n and the velocity v of the electron relative to the 
nucleus is —v n . Accordingly, the angular momentum L of the electron is 

L = r X p = m e ( r X v) = m e ( r n X v n ) (7.28) 

Combining equations (7.26), (7.27), and (7.28), we have 

1 d V(r) 

B =-.-L (7.29) 

em c c z r dr 

The spin-orbit energy —M s • B may be related to the spin and orbital 
angular momenta through equations (7.1) and (7.29) 

1 dV(r) 


-M„ B 


m\c 2 r dr 


L S 


This expression is not quite correct, however, because of a relativistic effect in 
changing from the perspective of the electron to the perspective of the nucleus. 
The correction, 2 known as the Thomas precession, introduces the factor \ on 
the right-hand side to give 


—M s • B = 


1 d V(r) 

2m 2 c 2 r dr 


S 


The corresponding spin-orbit Hamiltonian operator H so is, then, 

fv ... 1 dF(r) f ~ 

80 2m 2 c 2 r dr 


(7.30) 


1 R. P. Feyman, R. B. Leighton, and M. Sands (1964) The Feynman Lectures on Physics, Vol. II (Addison- 
Wesley, Reading, MA) section 14-7. 

2 J. D. Jackson (1975) Classical Electrodynamics, 2nd edition (John Wiley & Sons, New York) pp. 541-2. 
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For a hydrogen atom, the potential energy V(r) is given by equation (6.13) 
with Z = 1 and H so becomes 


H so = £(r)L • S 


(7.31) 


where 


£ 0 ) = 


( 7 . 32 ) 


SjT£ 0 mlc 2 r 3 

Thus, the total Flamiltonian operator // for a hydrogen atom including spin- 
orbit coupling is 

H = H 0 + H so = H 0 + £(r) L • S (7.33) 

where Ho is the Hamiltonian operator for the hydrogen atom without the 
inclusion of spin, as given in equation (6.14). 

The effect of the spin-orbit interaction term on the total energy is easily 
shown to be small. The angular momenta |L| and |S| are each on the order of h 
and the distance r is of the order of the radius ao of the first Bohr orbit. If we 
also neglect the small difference between the electronic mass m c and the 
reduced mass p, the spin-orbit energy is of the order of 

e 2 ft 2 2 

%7t£om 2 c 2 a\ 1 

where | E\ | is the ground-state energy for the hydrogen atom with Hamiltonian 
operator Ho as given by equation (6.57) and a is the fine structure constant, 
defined by 

e 2 h 1 

a = 


AjTEohc m e cao 137.036 
Thus, the spin-orbit interaction energy is about 5 X 10“ 5 times smaller than 

|£i|. 

While the Hamiltonian operator Ho for the hydrogen atom in the absence of 
the spin-orbit coupling term commutes with L and with S, the total Hamilto¬ 
nian operator H in equation (7.33) does not commute with either L or S 
because of the presence of the scalar product L • S. To illustrate this feature, 
we consider the commutators [L z , L • S] and [52, L • S], 

[L z , L • S] = \L Z , ( L X S X + LySy + L-S z )\ = [L z , L X \S X + [L z , L y ]S y + 0 


= i h(L y S x - L x S y ) fi 0 


(7.34) 


[52. L • S] = [57, S X \L X + [52, S y ]L y = i h(L x S y - L y S x ) fi 0 (7.35) 

where equations (5.10) and (7.2) have been used. Similar expressions apply to 
the other components of L and S. Thus, the vectors L and S are no longer 
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constants of motion. However, the operators L 2 and S 2 do commute with L • S, 
which follows from equations (5.15), so that the quantities L 2 and S 2 are still 
constants of motion. 

We now introduce the total angular momentum J, which is the sum of L 
and S 


J = L + S (7.36) 

The operators J and J 2 commute with H 0 . The addition of equations (7.34) 
and (7.35) gives 

[J z , L • S] = \L Z , L • S] + [S 2 , L ■ S] = 0 

The addition of similar relations for the x- and ^-components of these angular 
momentum vectors leads to the result that [J, L • S] = 0, so that J and L • S 
commute. Furthermore, we may easily show that J 2 commutes with L • S 
because each term in J 2 = L 2 + S 2 + 2L • S commutes with L • S. Thus, J 
and J 2 commute with II in equation (7.33) and J and J 2 are constants of 
motion. 

That the quantities L 2 , S 2 , J 2 , and J are constants of motion, but L and S are 
not, is illustrated in Figure 7.1. The spin magnetic moment M s , which is 
antiparallel to S, exerts a torque on the orbital magnetic moment M, which is 
antiparallel to L, and alters its direction, but not its magnitude. Thus, the orbital 
angular momentum vector L precesses about J and L is not a constant of 
motion. However, since the magnitude of L does not change, the quantity L 2 is 
a constant of motion. Likewise, the orbital magnetic moment M exerts a torque 
on M s , causing S to precess about J. The vector S is, then, not a constant of 



Figure 7.1 Precession of the orbital angular momentum vector L and the spin angular 
momentum vector S about their vector sum J. 
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motion, but S 2 is. Since J is fixed in direction and magnitude, both J and J 2 
are constants of motion. 

If we form the cross product j X j and substitute equations (7.36), (5.11), 
and (7.3), we obtain 

J X J = (L + S) X (L + S) = (L X L) + (S X S) = iht + ihS = i/hJ 

where the cross terms (L X S) and (S X L) cancel each other. Thus, the 
operator J obeys equation (5.12) and the quantum-mechanical treatment of 
Section 5.2 applies to the total angular momentum. Since J x , J y , and J z each 
commute with J 2 but do not commute with one another, we select J- and seek 
the simultaneous eigenfunctions | nlsjntj) of the set of mutually commuting 
operators H, L 2 , S 2 , J 2 , and J z 


H\nlsjmj) = E n \nlsjrrij) 

(7.37a) 

L 2 \nlsjnij ) = 1(1+ 1 )fi 2 nlsjmj) 

(7.37b) 

S 2 \nlsjnij ) = s(s + \)fi 2 \ nlsjntj) 

(7.37c) 

J 2 \nlsjntj) = j(j + \)ti 2 \nlsjnij) 

(7.37d) 

J : \nlsjmj) = mjh\nlsjnij ), ny = —j, — j + 1, .. 

•,7-1,7 (7.37e) 

From the expression 



J : \nlsjm.j) = ( L z = S z )\nlsjnij) = (m + m s )fi\nlsjmj) 


obtained from (7.36), (5.28b), and (7.5), we see that 

nij = m + m s (7.38) 

The quantum number j takes on the values 

l + s, l + s — 1, l -\- s — 2, ..., \l — s| 

The argument leading to this conclusion is somewhat complicated and may be 
found elsewhere. 3 In the application being considered here, the spin 5 equals | 
and the quantum number j can have only two values 

j=l±\ (7.39) 

The resulting vectors J are shown in Figure 7.2. 

The scalar product L • S in equation (7.33) may be expressed in terms of 
operators that commute with H by 

L • S = i(L + S) • (L + S) — ±L ■ L — iS • S = \{J 2 - L 2 - S 2 ) (7.40) 


3 


B. H. Brandsen and C. J. Joachain (1989) Introduction to Quantum Mechanics (Addison Wesley Longman, 
Harlow, Essex), pp. 299, 301; R. N. Zare (1988) Angular Momentum (John Wiley & Sons, New York), pp. 
45-8. 
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L 


L 


i - 1 + 2 j ~ 1 ~ 1 

Figure 7.2 The total angular momentum vectors J obtained from the sum of L and S 
for s — \ and s — — 




so that H becomes 

H=H 0 + £(r)(J 2 -L 2 - S 2 ) (7.41) 

Equation (7.37a) then takes the form 

{H 0 + \fi 2 ^{r){J{j + 1) - /(/ + 1) - s(s + 1)]}| nlsjrnj) = E n \nlsjrrij) (7.42) 
or 

n, l,\, 1 + mj) = E„\n, l,\, 1 + mj) if j = / + \ 

(7.43a) 

n, l, / - i, mj) = E„\n, l, I - i mj) 

if j = l - \ (7.43b) 

where equations (7.37b), (7.37c), (7.37d), and (7.39) have also been intro¬ 
duced. 

Since the spin-orbit interaction energy is small, the solution of equations 
(7.43) to obtain E n is most easily accomplished by means of perturbation 
theory, a technique which is presented in Chapter 9. The evaluation of E n is 
left as a problem at the end of Chapter 9. 


£7 (/ + \ )h 2 

Ho -2- 


- Ill c , N 

Ho + ^-%(r) 


Problems 

7.1 Determine the angle between the spin vector S and the z-axis for an electron in 
spin state |a). 

7.2 Prove equation (7.19) from equations (7.15) and (7.17). 
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7.3 Show that the pair of operators o v , o z anticommute. 

7.4 Using the Pauli spin matrices in equation (7.25) and the spinors in (7.13), 

(a) construct the operators o+ and cr_ corresponding to S. and .S' 

(b) operate on |a) and on |/3) with a 2 , o z , a + , cr_, o x , and o y and compare the 
results with equations (7.14), (7.15), (7.16), and (7.17). 

7.5 Using the Pauli spin matrices in equation (7.25), verify the relationships in (7.19) 
and (7.22). 
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Systems of identical particles 


The postulates 1 to 6 of quantum mechanics as stated in Sections 3.7 and 7.2 
apply to multi-particle systems provided that each of the particles is distin¬ 
guishable from the others. For example, the nucleus and the electron in a 
hydrogen-like atom are readily distinguishable by their differing masses and 
charges. When a system contains two or more identical particles, however, 
postulates 1 to 6 are not sufficient to predict the properties of the system. These 
postulates must be augmented by an additional postulate. This chapter intro¬ 
duces this new postulate and discusses its consequences. 


8.1 Permutations of identical particles 

Particles are identical if they cannot be distinguished one from another by any 
intrinsic property, such as mass, charge, or spin. There does not exist, in fact 
and in principle, any experimental procedure which can identify any one of the 
particles. In classical mechanics, even though all particles in the system may 
have the same intrinsic properties, each may be identified, at least in principle, 
by its precise trajectory as governed by Newton’s laws of motion. This 
identification is not possible in quantum theory because each particle does not 
possess a trajectory; instead, the wave function gives the probability density for 
finding the particle at each point in space. When a particle is found to be in 
some small region, there is no way of determining either theoretically or 
experimentally which particle it is. Thus, all electrons are identical and there¬ 
fore indistinguishable, as are all protons, all neutrons, all hydrogen atoms with 
1 H nuclei, all hydrogen atoms with 2 H nuclei, all helium atoms with 4 He 
nuclei, all helium atoms with 3 He nuclei, etc. 

Two-particle systems 

For simplicity, we first consider a system composed of two identical particles 
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of mass m . If we label one of the particles as particle 1 and the other as particle 
2, then the Hamiltonian operator H( \, 2) for the system is 

~2 ~2 

//(l,2) = |I + |^+F(q 1 ,q 2 ) (8.1) 

2 m 2 m 

where q, (i = 1,2) represents the three-dimensional (continuous) spatial 
coordinates r, and the (discrete) spin coordinate oq of particle i. In order for 
these two identical particles to be indistinguishable from each other, the 
Hamiltonian operator must be symmetric with respect to particle interchange, 
i.e., if the coordinates (both spatial and spin) of the particles are interchanged, 
//(1, 2) must remain invariant 

H( 1, 2) = H( 2, 1) 

If H( \, 2) and H( 2, 1) were to differ, then the corresponding Schrodinger 
equations and their solutions would also differ and this difference could be 
used to distinguish between the two particles. 

The time-independent Schrodinger equation for the two-particle system is 

H(l, 2)W v (l, 2) = E V W V (\, 2) (8.2) 

where v delineates the various states. The notation ’Ifif 1, 2) indicates that the 
first particle has coordinates qi and the second particle has coordinates q 2 . If 
we exchange the two particles so that particles 1 and 2 now have coordinates 
q 2 and qi, respectively, then the Schrodinger equation (8.2) becomes 

Hi2, 1)^(2, 1) = Hi 1, 2)W V (2, 1) = E V W V (2, 1) (8.3) 

where we have noted that H( 1, 2) is symmetric. Equation (8.3) shows that 
x V v (2, 1) is also an eigenfunction of 11(1, 2) belonging to the same eigenvalue 
E v . Thus, any linear combination of TVl, 2) and X E,.(2, 1) is also an eigen¬ 
function of H( 1, 2) with eigenvalue E v . For simplicity of notation in the 
following presentation, we omit the index v when it is clear that we are 
referring to a single quantum state. 

The eigenfunction 1, 2) has the form of a wave in six-dimensional space. 
The quantity ^*(1, 2) X V( \, 2) dri dr 2 is the probability that particle 1 with 
spin function X\ is in the volume element diq centered at iq and simultaneously 
particle 2 with spin function Xi is in the volume element dr? at r 2 . The product 
*P*(1, 2)^(1, 2) is, then, the probability density. The eigenfunction X ¥(2, 1) 
also has the form of a six-dimensional wave. The quantity X V'(2, 1 ) y V(2, 1) is 
the probability density for particle 2 being at ri with spin function Xi and 
simultaneously particle 1 being at r 2 with spin function y 2 . In general, the two 
eigenfunctions *P(1, 2) and V F(2, 1) are not identical. As an example, if 
Wl, 2) is 
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0( 1, 2) = Q~ an Q~ bri (br 2 - 1) 
where r x = |r 1 1 and r 2 = |r 2 |, then 0(2, 1) would be 

0(2, 1) = e~ ar2 e~ hr '(bri - l)f 0(1, 2) 

Thus, the probability density of the pair of particles depends on how we label 
the two particles. Since the two particles are indistinguishable, we conclude 
that neither 0(1, 2) nor 0(2, 1) are desirable wave functions. We seek a wave 
function that does not make a distinction between the two particles and, 
therefore, does not designate which particle is at ri and which is at r 2 . 

To that end, we now introduce the linear hermitian exchange operator P, 
which has the property 

>/(l,2) = /(2, 1) (8.4) 

where /(1, 2) is an arbitrary function of qi and q 2 . If P operates on 
H( 1, 2)0(1, 2), we have 

P[H( 1, 2)W(1, 2)] = H( 2, l)f(2, 1) = H( 1, 2)0(2, 1) = H{ 1, 2)>0(1, 2) 

(8.5) 

where we have used the fact that H( 1, 2) is symmetric. From equation (8.5) we 
see that P and H( 1, 2) commute 

[P,H( 1,2)] = 0, (8.6) 

Consequently, the operators P and //(l, 2) have simultaneous eigenfunctions. 

If 0(1, 2) is an eigenfunction of >, the corresponding eigenvalue X is given 
by 

>0(1,2) = A0(1,2) (8.7) 

We then have 

> 2 0(1, 2) = >[>0(1, 2)] = >[AO(l, 2)] = A>0(1, 2) = A 2 0(1, 2) (8.8) 

Moreover, operating on 0(1, 2) twice in succession by > returns the two 
particles to their original order, so that 

> 2 0(1, 2) = >0(2, 1) = 0(1, 2) (8.9) 

From equations (8.8) and (8.9), we see that > 2 = 1 and that A 2 = 1. Since P is 
hermitian, the eigenvalue A is real and we obtain A = ± 1. 

There are only two functions which are simultaneous eigenfunctions of 
//(l, 2) and P with respective eigenvalues E and ±1. These functions are the 
combinations 

0s = 2-'/ 2 [0(1,2) + 0(2, i)] 

0^ = 2- 1 / 2 [0(l,2)-0(2, 1)] 
which satisfy the relations 


(8.10a) 

(8.10b) 
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p\V S = W S (8.11a) 

PVa = -W a (8.11b) 

The factor 2" 1 / 2 in equations (8.10) normalizes Tfy and x ¥ t if T ; ( 1, 2) is 
normalized. The combination 'by is symmetric with respect to particle 
interchange because it remains unchanged when the two particles are ex¬ 
changed. The function 'tf i, on the other hand, is antisymmetric with respect to 
particle interchange because it changes sign, but is otherwise unchanged, when 
the particles are exchanged. 

The functions 'if i and 'By are orthogonal. To demonstrate this property, we 
note that the integral over all space of a function of two or more variables must 
be independent of the labeling of those variables 


fix i, ..., x N ) cLvi ... dx v = 


f(yu • • •, Tv) dvi ... dyv (8.12) 


In particular, we have 

I I/O > 2)dqi dq 2 


/(2, 1) dqi dq 2 


or 

mi, 2)\w(2, d) = m 2 , imo, 2)> (s.i3) 

where /(1, 2) = ^*(1, 2) X ¥(2, 1). Application of equation (8.13) to 
gives 

mim) = cs.i4) 

Applying equations (8.11) to the right-hand side of (8.14), we obtain 

mm> = -mim) 

Thus, the scalar product {TVyj'Ifi) must vanish, showing that X ¥ A and 'Ty are 
orthogonal. 

If the wave function for the system is initially symmetric (antisymmetric), 
then it remains symmetric (antisymmetric) as time progresses. This property 
follows from the time-dependent Schrodinger equation 

ih dW g’ 2) = Hi 1, 2)f(l, 2) (8.15) 

Since H( 1, 2) is symmetric, the time derivative 0 X ¥/dt has the same symmetry 
as W. During a small time interval At, therefore, the symmetry of W does not 
change. By repetition of this argument, the symmetry remains the same over a 
succession of small time intervals, and by extension over all time. 

Since 'By does not change and only the sign of'T,, changes if particles 1 and 
2 are interchanged, the respective probability densities 't^’Ty and 'T^'lfy are 
independent of how the particles are labeled. Neither specifies which particle 
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has coordinates qi and which q 2 . Thus, only the linear combinations 'Tv and 
'Ti are suitable wave functions for the two-identical-particle system. We note 
in passing that the two probability densities are not equal, even though 'Tv and 
correspond to the same energy value E. We conclude that in order to 
incorporate into quantum theory the indistinguishability of the two identical 
particles, we must restrict the allowable wave functions to those that are 
symmetric and antisymmetric, i.e., to those that are simultaneous eigenfunc¬ 
tions of H( 1, 2) and P. 


Three-particle systems 

The treatment of a three-particle system introduces a new feature not present in 
a two-particle system. Whereas there are only two possible permutations and 
therefore only one exchange or permutation operator for two particles, the 
three-particle system requires several permutation operators. 

We first label the particle with coordinates qi as particle 1, the one with 
coordinates q 2 as particle 2, and the one with coordinates q 3 as particle 3. The 
Hamiltonian operator H (\, 2, 3) is dependent on the positions, momentum 
operators, and perhaps spin coordinates of each of the three particles. For 
identical particles, this operator must be symmetric with respect to particle 
interchange 

H{ 1, 2, 3) = H{ 1, 3, 2) = H(2, 3, 1) = H{ 2, 1, 3) = H( 3, 1, 2) = H( 3, 2, 1) 
If W( 1, 2, 3) is a solution of the time-independent Schrodinger equation 

H( 1, 2, 3)*T(1, 2, 3) = fHT(l, 2, 3) (8.16) 

then 'TO, 3, 2), T(2, 3, 1), etc., and any linear combinations of these wave 
functions are also solutions with the same eigenvalue E. The notation 
W( i, j, k ) indicates that particle i has coordinates qi, particle j has coordinates 
q 2 , and particle k has coordinates q 3 . As in the two-particle case, we seek 
eigenfunctions of H{ 1, 2, 3) that do not specify which particle has coordinates 
q t, i= 1,2, 3. 

We define the six permutation operators P a _/> iy for a f (i f y = 1, 2, 3 by the 
relations 


Pi 23^(6 j, k) = 'T(/, j, k )' 
>132^(6 j, k) = 'T(/, k, j ) 
T 23 ,W(i, j, k) = 'T(/, k, 0 
PuflHk j , k) = '!'(/, i, k ) 
j, k) = T(/q i, j ) 
>32^(6 j, k) = 'T(/c, j, i) 


1,2,3 


(8.17) 


The operator 7),/iy replaces the particle with coordinates qi (the first position) 
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by the particle with coordinates q a , the particle with coordinates q 2 (the second 
position) by that with q^, and the particle with coordinates q 3 (the third 
position) by that with q y . For example, we have 


Wl,2, 3) = W(2, 1,3) 

(8.18a) 

p 213 ^(2, 1, 3) = m 2 , 3) 

(8.18b) 

>2!3^(3, 2 , 1) = 3, 1) 

(8.18c) 

>23! W, 2, 3) = ; F(2, 3, 1) 

(8.18d) 

>231^(2, 3, 1) = m 1, 2) 

(8.18e) 


The permutation operator P l2 3 is an identity operator because it leaves the 
function x V{i, j, k ) unchanged. From (8.18a) and (8.18b), we obtain 

>2 13 W,2, 3) = W(l, 2, 3) 

so that P\ n equals unity. The same relationship can be demonstrated to apply 
to the operators P 132 and P 321 , as well as to the identity operator P 123 , giving 

^213 = ^132 = ^321 = ^123 = >123 = 1 (8-19) 

Any permutation corresponding to one of the operators P a p y other than P 123 
is equivalent to one or two pairwise exchanges. Accordingly, we introduce the 
linear hermitian exchange operators P\ 2 , P 22 , and P 31 with the properties 

P\2 X V(U j, k) = W(j, i, k) ) 

P 2 mi, j, k) = W(i, k, j) ) i + j+ k = 1, 2, 3 (8.20) 

/vm j, k) = w(k, j, i ) J 

The exchange operator P a j> interchanges the particles with coordinates q a and 
q^. It is obvious that the order of the subscripts in P a p is immaterial, so that 
P a p = P/j a . The permutations from P 2 13 , Pi 32 , and P 22i are the same as those 
from P\ 2 , P 23 , and P 2 \, respectively, giving 

Pm = P\2, P 132 = P23, F321 = P31 

The permutation from F 231 may also be obtained by first applying the exchange 
operator P\ 2 and then the operator P 22 . Alternatively, the same result may be 
obtained by first applying P 2 3 followed by P 2 1 or by first applying P 2 \ followed 
by P\ 2 - This observation leads to the identities 

Pl 3 \ = P23P12 = P31P23 = P\ 2 P 2 \ ( 8 . 21 ) 

A similar argument yields 

P3 12 = P3\P\2 = P23P3\ = P\2p23 ( 8 . 22 ) 

These permutations of the three particles are expressed in terms of the 
minimum number of pairwise exchange operators. Less efficient routes can 
also be visualized. For example, the permutation operators Pi 32 and P 2 3 \ may 
also be expressed as 
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>1 32 = >3l>23>12 = >12>23>31 


P 231 — >12>23>31>12 — >31>12>31>12 


However, the number of pairwise exchanges for a given permutation is always 
either odd or even, so that Pm, > 231 , >312 are even permutations and Pm, 
P 213 , >321 are odd permutations. 

Applying the same arguments regarding the exchange operator > for the 
two-particle system, we find that 



P 2 — 1 

■*31 1 


giving real eigenvalues ± 1 for each operator. We also find that each exchange 
operator commutes with the Hamiltonian operator H 


[>12, H ] = [>23, H ] = [>31, H] = 0 (8.23) 

so that >12 and H possess simultaneous eigenfunctions, >23 and H possess 
simultaneous eigenfunctions, and >31 and H possess simultaneous eigenfunc¬ 
tions. However, the operators >12, >23, >31 do not commute with each other. 
For example, if we operate on the wave function hh(l, 2, 3) first with the 
product >31 >12 and then with the product >i2>3i, we obtain 


> 31 > 12 ^(1, 2, 3) = > 3 ^( 2 , 1, 3) = m 1, 2) 

> 12 > 3 i^(l, 2, 3) = > 12 m 2, 1) = rp(2, 3, 1) 

The wave function rp(3, 1, 2) is not the same as ^(2,3,1), leading to the 
conclusion that 


>31 >12 > >12 >31 

Thus, a set of simultaneous eigenfunctions of H( 1, 2, 3) and >12 and a set of 
simultaneous eigenfunctions of H( 1, 2, 3) and > 31 are not, in general, the same 
set. Likewise, neither set are simultaneous eigenfunctions of H( 1, 2, 3) and 

> 23 - 

There are, however, two eigenfunctions of H( 1, 2, 3) which are also simul¬ 
taneous eigenfunctions of all three pair exchange operators > 12 , > 23 , and > 31 . 
These eigenfunctions are 'F s and V 1L/, which have the property 

Pa^s = ^s, a. fP = 1,2 (8.24a) 


PafPVA = -^a, af/3= 1,2 (8.24b) 

To demonstrate this feature, we assume that ^(1, 2, 3) is a simultaneous 
eigenfunction not only of H( 1, 2, 3), but also of > 12 , > 23 , and > 31 . Therefore, 
we have 


H 12 W(1,2, 3) = 2^(1, 2, 3) 
> 23 ^ 1,2, 3) = i 2 f(l, 2, 3) 


(8.25) 
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>3iW,2, 3) = A 3 W,2, 3) 
where X\ = ±1, A 2 = ±1, A 3 = ±1 are the respective eigenvalues. From equa¬ 
tions (8.21) and (8.25), we obtain 

>231^(1, 2, 3) = >23>12^(1, 2, 3) = >31>23^(1, 2, 3) = PnPimi 2 , 3) 

= W(2, 3, 1) 
or 

A 2 Ai*F(1, 2, 3) = A 3 A 2 'F(1, 2, 3) = AiA 3 W(1, 2, 3) 
from which it follows that 

Ai = A 2 = A 3 

Thus, the simultaneous eigenfunctions ^(1, 2, 3) are either symmetric 
(Aj = 1 2 = A 3 = 1) or antisymmetric (Ai = A 2 = A 3 = — 1). 

The symmetric 'Fs or antisymmetric 'F,, eigenfunctions may be constructed 
from *F(1, 2, 3) by the relations 

W s = 6“ 1/2 [W(1, 2, 3) + ^(1, 3, 2) + *F(2, 3, 1) + *F(2, 1, 3) + *P(3, 1, 2) 

+ ^(3,2,1)] (8.26a) 

W A = 6- x / 2 [^(1, 2, 3) - W(l, 3, 2) + *P(2, 3, 1) - W(2, 1, 3) + *P(3, 1, 2) 

-^3,2, 1)] (8.26b) 

where the factor 6 -1 / 2 normalizes 'Fs and W a if 'F( 1, 2, 3) is normalized. As 
in the two-particle case, the functions 'Fy and 'F ( are orthogonal. Moreover, a 
wave function which is initially symmetric (antisymmetric) remains symmetric 
(antisymmetric) over time. The probability densities 'F * 'Fy and 'F*'F i are 
independent of how the three particles are labeled. The two functions 'Fy and 
W A are, therefore, the eigenfunctions of H(l, 2, 3) that we are seeking. 

Equations (8.26) may be expressed in another, equivalent way. If we let P be 
any one of the permutation operators P„p y in equation (8.17), then we may 
write 

WS.A = 6" 1/2 ^ dp/"P(l, 2, 3) (8.27) 

p 

where the summation is taken over the six different operators P a p y , and dp is 
either +1 or — 1. For the symmetric wave function 'Fs, d P is always +1, but for 
the antisymmetric wave function 'F /( , dp is +1 (—1) if the permutation 
operator P involves the exchange of an even (odd) number of pairs of particles. 
Thus, dp is —1 for P n2 , Pin and P 22 \. 

N-particle systems 

The treatment of a three-particle system may be generalized to an /V-particlc 
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system. We begin by labeling the N particles, with each particle i having 
coordinates q ; . For identical particles, the Hamiltonian operator must be 
symmetric with respect to particle permutations 

m, 2, N) = H( 2, 1, ..., N) = H(N, 2.1) = - - ■ 

There are Ad possible permutations of the N particles. If W(l, 2, ..., N) is a 
solution of the time-independent Schrodinger equation 

H( 1, 2, , AWT, 2, ..., N) = £f(l, 2, ..., N) (8.28) 

then ^(2, 1, ..., N ), WfA, 2, ..., 1), etc., and any linear combination of 
these wave functions are also solutions with eigenvalue E. 

We next introduce the set of linear hermitian exchange operators P a p 
(a f f3 = 1,2,..., N). The exchange operator P a p interchanges the pair of 
particles in positions a (with coordinates q a ) and (3 (with coordinates q/;) 

PafiVU, = W(i, ...,k, (8.29) 

a P “ p 

As in the three-particle case, the order of the subscripts on P a p is immaterial. 
Since there are N choices for the first particle and (N — 1) choices for the 
second particle (a f [>) and since each pair is to be counted only once 
(P a p = P g a ), there arc N( N — l)/2 members of the set P a p. 

Applying the same arguments regarding the exchange operator P for the 
two-particle system, we find that P 2 a/5 = 1, giving real eigenvalues ±1. We also 
find that P a p and H commute 

[Pafi,H] = 0, 1,2, ..., N (8.30) 

so that they possess simultaneous eigenfunctions. However, the members of the 
set P„fj do not commute with each other. There are only two functions, 'Fs and 
W A , which are simultaneous eigenfunctions of // and all of the pairwise 
exchange operators P a p. These two functions have the property 

Pafi'Vs = x V.s, 1,2, ..., N (8.31a) 

P af} W A = -W A , af/3= 1,2, ..., N (8.31b) 

and may be constructed from ^(1,2, ..., A) by the relation 

W = (^!) _1/2 ^ <5pW1, 2, ..., N) (8.32) 

p 

In equation (8.32) the operator P is any one of the N\ operators, including the 
identity operator, that permute a given order of particles to another order. The 
summation is taken over all N\ permutation operators. The quantity dp is 
always +1 for the symmetric wave function l I ; v, but for the antisymmetric 
wave function V F ,|, d P is +1 (—1) if the permutation operator P involves the 
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exchange of an even (odd) number of particle pairs. The factor (AH) -1 / 2 
normalizes W 5 and if *P(1, 2, ..., N) is normalized. 

Using the same arguments as before, we can show that 'I 1 *, and X V A in 
equation (8.32) are orthogonal and that, over time, 'Ey remains symmetric and 
v T ,i remains antisymmetric. Since the probability densities 'T^’l's and v F* l I ; 
are independent of how the N particles are labeled, the two functions v Fs and 
v T ,i are the only suitable eigenfunctions of H{\, 2, ..., N) to represent a 
system of N indistinguishable particles. 


8.2 Bosons and fermions 

In quantum theory, identical particles must be indistinguishable in order for the 
theory to predict results that agree with experimental observations. Conse¬ 
quently, as shown in Section 8.1, the wave functions for a multi-particle system 
must be symmetric or antisymmetric with respect to the interchange of any pair 
of particles. If the wave functions are not either symmetric or antisymmetric, 
then the probability densities for the distribution of the particles over space are 
dependent on how the particles are labeled, a property that is inconsistent with 
indistinguishability. It turns out that these wave functions must be further 
restricted to be either symmetric or antisymmetric, but not both, depending on 
the identity of the particles. 

In order to accommodate this feature into quantum mechanics, we must add 
a seventh postulate to the six postulates stated in Sections 3.7 and 7.2. 

7. The wave function for a system of N identical particles is either symmetric or 
antisymmetric with respect to the interchange of any pair of the N particles. 
Elementary or composite particles with integral spins (.v s? 0. 1,2, ...) possess 
symmetric wave functions, while those with half-integral spins (5 = |, ...) 

possess antisymmetric wave functions. 

The relationship between spin and the symmetry character of the wave function 
can be established in relativistic quantum theory. In non-relativistic quantum 
mechanics, however, this relationship must be regarded as a postulate. 

As pointed out in Section 7.2, electrons, protons, and neutrons have spin 
Therefore, a system of N electrons, or N protons, or N neutrons possesses an 
antisymmetric wave function. A symmetric wave function is not allowed. 
Nuclei of 4 He and atoms of 4 He have spin 0, while photons and 2 H nuclei have 
spin 1. Accordingly, these particles possess symmetric wave functions, never 
antisymmetric wave functions. If a system is composed of several kinds of 
particles, then its wave function must be separately symmetric or antisym¬ 
metric with respect to each type of particle. For example, the wave function for 
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the hydrogen molecule must be antisymmetric with respect to the interchange 
of the two nuclei (protons) and also antisymmetric with respect to the 
interchange of the two electrons. As another example, the wave function for the 
oxygen molecule with 16 0 nuclei (each with spin 0) must be symmetric with 
respect to the interchange of the two nuclei and antisymmetric with respect to 
the interchange of any pair of the eight electrons. 

The behavior of a multi-particle system with a symmetric wave function 
differs markedly from the behavior of a system with an antisymmetric wave 
function. Particles with integral spin and therefore symmetric wave functions 
satisfy Bose-Einstein statistics and are called bosons, while particles with 
antisymmetric wave functions satisfy Fermi-Dirac statistics and are called 
fermions. Systems of 4 He atoms (helium-4) and of 3 He atoms (helium-3) 
provide an excellent illustration. The 4 He atom is a boson with spin 0 because 
the spins of the two protons and the two neutrons in the nucleus and of the two 
electrons are paired. The 3 He atom is a fermion with spin \ because the single 
neutron in the nucleus is unpaired. Because these two atoms obey different 
statistics, the thermodynamic and other macroscopic properties of liquid 
helium-4 and liquid helium-3 are dramatically different. 


8.3 Completeness relation 

The completeness relation for a multi-dimensional wave function is given by 
equation (3.32). However, this expression does not apply to the wave functions 
TVs '..4 for a system of identical particles because TVs,u are either symmetric or 
antisymmetric, whereas the right-hand side of equation (3.32) is neither. 
Accordingly, we derive here 1 the appropriate expression for the completeness 
relation or, as it is often called, the closure property for TVs.u- 

For compactness of notation, we introduce the 4/V-dimensional vector Q 
with components q, for i = 1, 2, ..., N. The permutation operators P are 
allowed to operate on Q directly rather than on the wave functions. Thus, the 
expression PT^l, 2, ..., N) is identical to T^PQ). In this notation, equation 
(8.32) takes the form 

Vvs.a = IN!)" 1 / 2 ^ c5pTV(PQ) (8.33) 

p 

We begin by considering an arbitrary function /(Q) of the 4/V-dimensional 
vector Q. Following equation (8.33), we can construct from /(Q) a function 
F( Q) which is either symmetric or antisymmetric by the relation 

1 We follow the derivation of D. D. Fitts (1968) Nuovo Cimento 5SB, 557. 
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F(Q) = (N\)~ 1/2 ]T d P f(PQ) (8.34) 

P 

Since F( Q) is symmetric (antisymmetric), it may be expanded in terms of a 
complete set of symmetric (antisymmetric) wave functions l l ; ,,(Q) (we omit the 
subscript S, A) 


F{Q) = Y j ^v(Q) 

V 


The coefficients c v are given by 


c v 



(Q')F(Q') dQ' 


(8.35) 


(8.36) 


because the wave functions *P„(Q) are orthonormal. We use the integral 
notation to include summation over the spin coordinates as well as integration 
over the spatial coordinates. Substitution of equation (8.36) into (8.35) yields 


F( Q) 


Uq') 


^2W*(Q')W V (Q) 


dQ' 


(8.37) 


where the order of summation and the integration over Q' have been inter¬ 
changed. We next substitute equation (8.34) for F(Q') into (8.37) to obtain 


TXQ) = (Anr 1/2 ^ 


f(PQ') 


5> V *(QWQ) 


dQ' 


(8.38) 


We now introduce the reciprocal or inverse operator P 1 to the permutation 
operator P (see Section 3.1) such that 

P~ l P = PP~ X = 1 


We observe that 

^(> _1 Q) = d P ^W v (Q) = 6 P W V (Q) (8.39) 

The quantity <) P -i equals d P because both P~ l and P involve the interchange 
of the same number of particle pairs. We also note that 


^(5 2 p = W (8.40) 

p 


because there are N\ terms in the summation and each term equals unity. 

We next operate on each term on the right-hand side of equation (8.38) by 
P~ l . Since P in equation (8.38) operates only on the variable Q' and since the 
order of integration over Q' is immaterial, we obtain 


F(Q) = (N\r l/2 J2 6 ^ 


Z(Q') 


^W*(Z-'QWQ) 


dQ' (8.41) 
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Application of equations (8.39) and (8.40) to (8.41) gives 


F(Q) = (N\) l/2 


f(Q') 


J2 x v*AQ'Wv(.Q) 


dQ' 


(8.42) 


Since /(Q') is a completely arbitrary function of Q', we may compare 
equations (8.34) and (8.42) and obtain 


]T W*(Q')^(Q) = (A!)- 1 dpd(PQ - Q) (8.43) 

V P 


where d(Q — Q') is the Dirac delta function 

N 

<5(Q - Q ) = n - r ^; (8-44) 

i= 1 

Equation (8.43) is the completeness relation for a complete set of symmetric 
(antisymmetric) multi-particle wave functions. 


8.4 Non-interacting particles 

In this section we consider a many-particle system in which the particles act 
independently of each other. For such a system of N identical particles, the 
Hamiltonian operator H(\, 2, ..., N) may be written as the sum of one- 
particle Hamiltonian operators H(i) for i = 1,2,..., N 

H{ 1, 2, N) = H( 1) + H( 2) + ■ • ■ + H(N) (8.45) 

In this case, the operator H(l, 2, ..., N) is obviously symmetric with respect 
to particle interchanges. For the N particles to be identical, the operators H(i) 
must all have the same form, the same set of orthonormal eigenfunctions tp n (i), 
and the same set of eigenvalues E n , where 

H(i)ip n (i) = E n ip n (i ); i = 1, 2, ..., N (8.46) 

As a consequence of equation (8.45), the eigenfunctions *P V (1, 2, ..., N) of 
H{\, 2, ..., N) are products of the one-particle eigenfunctions 

^„(1, 2, ..., N) = tp a ( 1)^(2) • • • fp(N) (8.47) 

and the eigenvalues E v of //(1, 2, ..., N) are sums of one-particle energies 

E v = E a + Eh + ■ ■ ■ + E p (8.48) 

In equations (8.47) and (8.48), the index v represents the set of one-particle 
states a, b, p and indicates the state of the A-particle system. 

The /V-particlc eigenfunctions x V v ( 1, 2, .... /V) in equation (8.47) are not 
properly symmetrized. For bosons, the wave function ^,,(1, 2, ..., N) must be 
symmetric with respect to particle interchange and for fermions it must be 
antisymmetric. Properly symmetrized wave functions may be readily con- 
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structed by applying equation (8.32). For example, for a system of two identical 
particles, one particle in state ip a , the other in state ?/>/,, the symmetrized two- 
particle wave functions are 

'fVsll, 2) = 2~ l l 2 [xp a (\)xp b (2) + xp a {2)xp b {\)] (8.49a) 

VabA 1 . 2) = 2- l ' 2 \iPa(l)ip b (2) - ip a (2)ip b (\)] (8.49b) 

The expression (8.49a) for two bosons is not quite right, however, if states ip a 
and xp b are the same state (a = b ), for then the normalization constant is \ 
rather than 2~ li/2 , so that 

^( 1 , 2 ) = iPa( Wa(2) 

From equation (8.49b), we see that the wavefunction vanishes for two identical 
fermions in the same single-particle state 

^( 1 , 2 ) = 0 

In other words, two identical fermions cannot simultaneously be in the same 
quantum state. This statement is known as the Pauli exclusion principle 
because it was first postulated by W. Pauli (1925) in order to explain the 
periodic table of the elements. 

For N identical non-interacting bosons, equation (8.32) needs to be modified 
in order for 'Ps to be normalized when some particles are in identical single¬ 
particle states. The modified expression is 

1 /2 

= ( Na!N »- • • ) 7 ]T Pxp a (l)f b {2) .. • f P (N) (8.50) 
k 2 p 

where N n indicates the number of times the state n occurs in the product of the 
single-particle wave functions. Permutations which give the same product are 
included only once in the summation on the right-hand side of equation (8.50). 
For example, for three particles, with two in state a and one in state b, the 
products ip a (\)ipa(2)fb(f>) and xp a (2.)xp a (\)xp &(3) are identical and only one is 
included in the summation. 

For N identical non-interacting fermions, equation (8.32) may also be 
expressed as a Slater determinant 

xp a (\) lpa(2) ■ ■ ■ lpa(N) 

xp A = (tfl)-l/2 V'ftO) x Pb(2) ••• xp b (N) (851) 

xpp(l) Ip p (2) ••• xpp(N) 

The expansion of this determinant is identical to equation (8.32) with 
^(l, 2, ..., N) given by (8.47). The properties of determinants are discussed 
in Appendix I. The wave function W A in equation (8.51) is clearly antisym¬ 
metric because interchanging any pair of particles is equivalent to interchan- 
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ging two columns and hence changes the sign of the determinant. Moreover, if 
any pair of particles are in the same single-particle state, then two rows of the 
Slater determinant are identical and the determinant vanishes, in agreement 
with the Pauli exclusion principle. 

Although the concept of non-interacting particles is an idealization, the 
model may be applied to real systems as an approximation when the inter¬ 
actions between particles are small. Such an approximation is often useful as a 
starting point for more extensive calculations, such as those discussed in 
Chapter 9. 


Probability densities 

The difference in behavior between bosons and fermions is clearly demon¬ 
strated by their probability densities | v l ; v| 2 and I'lfi, 2 . For a pair of non¬ 
interacting bosons, we have from equation (8.49a) 

l^l 2 = £|V'«(1)| 2 |V'*(2)| 2 + \\ipa(2)\ 2 \tp b (l)\ 2 + Re[^(l)V^(2)Va2)tMl)] 

(8.52) 


For a pair of non-interacting fermions, equation (8.49b) gives 

|^| 2 = ±|V'«(l)| 2 |V'i(2)| 2 + I|^(2)| 2 |^(1)| 2 - 

(8.53) 

The probability density for a pair of distinguishable particles with particle 1 in 
state a and particle 2 in state b is |?/> a (l)| 2 |'fy : >/,(2)| 2 . If the distinguishable 
particles are interchanged, the probability density is |r/> a (2)| 2 |V^(l)| 2 . The 
probability density for one distinguishable particle (either one) being in state a 
and the other in state b is, then 

i|t/ai)| 2 |t/h,(2)| 2 + 2 l'/’«( 2 )| ? |v’/>( | )| 2 

which appears in both l^sl 2 and |'F^| 2 . The last term on the right-hand sides 
of equations (8.52) and (8.53) arises because the particles are indistinguishable 
and this term is known as the exchange density or overlap density. Since the 
exchange density is added in 1 2 and subtracted in |TF ( | 2 , it is responsible 
for the different behavior of bosons and fermions. 

The values of l^sl 2 and | v F j| 2 when the two particles have the same 
coordinate value, say qo, so that qi = q 2 = qo, are 
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Mo = ^a(qo)\ 2 \'ipb(qo )\ 2 + p\fa(qo)\ 2 \fb(qo )\ 2 

+ (qoMt (Qo)V ; a(qo0^6(qo)] 

= 2|^ a (qo)| 2 ^fe(qo)| 2 

Mlo = 2l i /’«(qo)| 2 l'/’/.(qo )| 2 + ^«(qo)| 2 ^fe(qo )| 2 
- Re[^*(qo)v^Cqo)v^(qo)v^(qo)] 

= 0 

Thus, the two bosons have an increased probability density of being at the same 
point in space, while the two fermions have a vanishing probability density of 
being at the same point. This conclusion also applies to systems with N 
identical particles. Identical bosons (fermions) behave as though they are under 
the influence of mutually attractive (repulsive) forces. These apparent forces 
are called exchange forces, although they are not forces in the mechanical 
sense, but rather statistical results. 

The exchange density in equations (8.52) and (8.53) is important only when 
the single-particle wave functions f a ( q) and ?/>/,(q) overlap substantially. 
Suppose that the probability density |Va(q)| 2 is negligibly small except in a 
region A and that |^/>(q)| 2 is negligibly small except in a region B, which does 
not overlap with region A. The quantities ?/>*(l )?/>/, (1) and ipfflyipjl) are then 
negligibly small and the exchange density essentially vanishes. For qi in region 
A and q 2 in region B , only the first term \>p a ( 1 )[ 2 |y>/ ; (2)| 2 on the right-hand 
sides of equations (8.52) and (8.53) is important. This expression is just the 
probability density for particle 1 confined to region A and particle 2 confined 
to region B. The two particles become distinguishable by means of their 
locations and their joint wave function does not need to be made symmetric or 
antisymmetric. Thus, only particles whose probability densities overlap to a 
non-negligible extent need to be included in the symmetrization process. For 
example, electrons in a non-bonded atom and electrons within a molecule 
possess antisymmetric wave functions; electrons in neighboring atoms and 
molecules are too remote to be included. 


Electron spin and the helium atom 

We may express the single-particle wave function ipniq,) as the product of a 
spatial wave function 0„(r,) and a spin function yji). For a fermion with spin 
such as an electron, there are just two spin states, which we designate by a(i) 
for m s = j and fi(i) for m s = — Therefore, for two particles there are three 
symmetric spin wave functions 
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a(l)a(2) 

mm 

2- 1 / 2 [a(l)y3(2) + a(2) y S(l)] 
and one antisymmetric spin wave function 

2- 1 / 2 [ a(l )y S ( 2) - a(2)/3(l)] 

where the factors 2' 1/2 are normalization constants. When the spatial and spin 
wave functions are combined, there are four antisymmetric combinations: a 
singlet state (.S' = 0) 

i[0 fl (l)0*(2) + 0 fl (2)0 4 (l)][a(l)/S(2) - a(2)yS(l)] 
and three triplet states (S’ = 1) 

f a(l)a(2) 

2- 1/2 [0 a (l)0 6 (2) - 0o(2)0fe(l)]< yS(l)/8(2) 

[ 2 _1 / 2 [a(l)/3(2) + a(2)/3(l)] 

These four antisymmetric wave functions are normalized if the single-particle 
spatial wave functions 0„(r,•) are normalized. If the two fermions are in the 
same state 0 a (r ( ), then only the singlet state occurs 

2 _ 1 /2 0a(T)0o(2)[a(l)/3(2) - a(2)/3(l)] 


The helium atom serves as a simple example for the application of this 
construction. If the nucleus (for which Z = 2) is considered to be fixed in 
space, the Hamiltonian operator H for the two electrons is 


ft 2 7 p' 2 


Ze’ 2 e' 2 

-1- 

r 2 r 12 


(8.54) 


where r\ and r 2 are the distances of electrons 1 and 2 from the nucleus, r l2 is 
the distance between the two electrons, and e' = e for CGS units or 
e' = e/(4rrc 0 ) l/2 for SI units. Spin-orbit and spin-spin interactions of the 
electrons are small and have been neglected. The electron-electron interaction 
is relatively small in comparison with the interaction between an electron and a 
nucleus, so that as a crude first-order approximation the last term on the right- 
hand side of equation (8.54) may be neglected. The operator H then becomes 
the sum of two hydrogen-atom Hamiltonian operators with Z = 2. The 
corresponding single-particle states are the hydrogen-like atomic orbitals f„i m 
discussed in Section 6.4. The energy of the helium atom depends on the 
principal quantum numbers n\ and n 2 of the two electrons and is the sum of 
two hydrogen-like atomic energies with Z = 2 


E 


n\,U2 


m e Z 2 e' 4 / 1 1 \ 

2 h 2 yn 2 n\) 


-54.4 eV 
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In the ground state of helium, according to this model, the two electrons are 
in the Is orbital with opposing spins. The ground-state wave function is 

W 0 (l, 2) = 2 -1 / 2 1 s( 1) 1 s(2)[a( 1 )/?(2) - a(2)/3(l)] 

and the ground-state energy is —108.8 eV. The energy of the ground state of 
the helium ion He + , for which n\ = 1 and = oc, is —54.4 eY In Section 
9.6, we consider the contribution of the electron-electron repulsion term to the 
ground-state energy of helium and obtain more realistic values. 

Although the orbital energies for a hydrogen-like atom depend only on the 
principal quantum number n, for a multi-electron atom these orbital energies 
increase as the azimuthal quantum number / increases. The reason is that the 
electron probability density near the nucleus decreases as / increases, as shown 
in Figure 6.5. Therefore, on average, an electron with a larger l value is 
screened from the attractive force of the nucleus by the inner electrons more 
than an electron with a smaller l value, thereby increasing its energy. Thus, the 
2s orbital has a lower energy than the 2p orbitals. 

Following this argument, in the first- and second-excited states, the electrons 
are placed in the Is and 2s orbitals. The antisymmetric spatial wave function 
has the lower energy, so that the first-excited state ^(l, 2) is a triplet state, 

f a(l)a(2) 

WK1, 2) = 2 -1 / 2 [1s(1)2s( 2) - 1 s(2)2s( 1)] < y3(l)j3(2) 

[ 2~ 1 / 2 [a(l)/3(2) + a(2)/3(l)] 

and the second-excited state x T f 2 ( 1, 2) is a singlet state 

W 2 (l, 2) = i[ls(l)2s(2) + 1 s(2)2s( 1)][a( 1 )/3(2) - a(2)/3(l)] 

Similar constructions apply to higher excited states. The triplet states are called 
orthohelium, while the singlet states are called parahelium. For a given pair of 
atomic orbitals, the orthohelium has the lower energy. In constructing these 
excited states, we place one of the electrons in the Is atomic orbital and the 
other in an excited atomic orbital. If both electrons were placed in excited 
orbitals (ri\ >2, n 2 5 s 2), the resulting energy would be equal to or greater 
than —27.2 eY which is greater than the energy of He + , and the atom would 
ionize. 

This same procedure may be used to explain, in a qualitative way, the 
chemical behavior of the elements in the periodic table. The application of the 
Pauli exclusion principle to the ground states of multi-electron atoms is 
discussed in great detail in most elementary textbooks on the principles of 
chemistry and, therefore, is not repeated here. 
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8.5 The free-electron gas 

The concept of non-interacting fermions may be applied to electrons in a metal. 
A metal consists of an ordered three-dimensional array of atoms in which some 
of the valence electrons are so weakly bound to their parent atoms that they 
form an ‘electron gas’. These mobile electrons then move in the Coulombic 
field produced by the array of ionized atoms. In addition, the mobile electrons 
repel each other according to Coulomb’s law. For a given mobile electron, its 
Coulombic interactions with the ions and the other mobile electrons are long- 
ranged and are relatively constant over the range of the electron’s position. 
Consequently, as a first-order approximation, the mobile electrons may be 
treated as a gas of identical non-interacting fermions in a constant potential 
energy field. 

The free-electron gas was first applied to a metal by A. Sommerfeld (1928) 
and this application is also known as the Sommerfeld model. Although the 
model does not give results that are in quantitative agreement with experi¬ 
ments, it does predict the qualitative behavior of the electronic contribution to 
the heat capacity, electrical and thermal conductivity, and thermionic emission. 
The reason for the success of this model is that the quantum effects due to the 
antisymmetric character of the electronic wave function are very large and 
dominate the effects of the Coulombic interactions. 

Each of the electrons in the free-electron gas may be regarded as a particle 
in a three-dimensional box, as discussed in Section 2.8. Energies may be 
defined relative to the constant potential energy field due to the electron-ion 
and electron-electron interactions in the metallic crystal, so that we may 
arbitrarily set this potential energy equal to zero without loss of generality. 
Since the mobile electrons are not allowed to leave the metal, the potential 
energy outside the metal is infinite. For simplicity, we assume that the metallic 
crystal is a cube of volume u with sides of length a, so that v = a 2 . As given 
by equations (2.82) and (2.83), the single-particle wave functions and energy 
levels are 




y 


,«z 



riyJtx . n v 7iy . 
—— sin ——— sin 
a a 


n : Jtz 

a 


(8.55) 


h 2 


8 m e a 2 


(n 2 x + n 2 + n 2 z ) 


(8.56) 


where m c is the electronic mass and the quantum numbers n x , n y , n z have 
values n x , n v , n z = 1, 2, 3, ... . 

We next consider a three-dimensional cartesian space with axes n x , n y , n z . 
Each point in this n-space with positive (but non-zero) integer values of n x , n y , 
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and n : corresponds to a single-particle state ipn x ,n y ,n z - These points all lie in the 
positive octant of this space. If we divide the octant into unit cubic cells, every 
point representing a single-particle state lies at the corner of one of these unit 
cells. Accordingly, we may associate a volume of unit size with each single¬ 
particle state. Equation (8.56) may be rewritten in the form 

lm c a 2 E 


n 2 x + n 2 y +n 2 z = 


h 2 


which we recognize as the equation in w-space of a sphere with radius R equal 
to ^/Sm e a 2 E/ h 2 . The number.//'’( E) of single-particle states with energy less 
than or equal to E is then the volume of the octant of a sphere of radius R 

2 cA 3 /2 

8 m e a E\ 


■SV\E) = 


1 4ji 

sT 


= 


71 


h 2 


- 


4jtv 

3F 


(2 m e Ef' 2 


(8.57) 


The number of single-particle states with energies between E and E + dE is 
oj(E) dE, where «>(E) is the density of single-particle states and is related to 
JE{E) by 


oj(E) 


d i (/;> 
d E 




(8.58) 


According to the Pauli exclusion principle, no more than two electrons, one 
spin up, the other spin down, can have the same set of quantum numbers n x , n y , 
n z . At a temperature of absolute zero, two electrons can be in the ground state 
with energy 3/? 2 /Sm e a 2 , two in each of the three states with energy 6h 2 /8m e a 2 , 
two in each of the three states with energy 9h 2 /Sm e a 2 , etc. The states with the 
lowest energies are filled, each with two electrons, until the spherical octant in 
w-space is filled up to a value E F , which is called the Fermi energy. If there are 
N electrons in the free-electron gas, then we have 

8 JIV , n 

N = 2/P (E f ) = — (2m c E F f 2 (8.59) 


or 


E f 


h 2 f5N\ 2/3 
8 m e \tcv ) 


(8.60) 


where equation (8.57) has been used. The Fermi energy is dependent on the 
density N/v of the free-electron gas, but not on the size of the metallic crystal. 
The total energy E tot of the N particles is given by 


Et ot — 2 


e ¥ 


Eo>( E) d E 


(8.61) 


Jo 

where the factor 2 in front of the integral arises because each single-particle 
state is doubly occupied. Substitution of equation (8.58) into (8.61) gives 
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£, o ,=^( 2 m ,) J/2 £ F s/2 

which may be simplified to 

Etot = ^ NEp 

The average energy E per electron is, then 



(8.62) 

(8.63) 


Equations (8.57) and (8.58) are valid only for values of E sufficiently large 
and for energy levels sufficiently close together that E can be treated as a 
continuous variable. For a metallic crystal of volume 1 cm 3 , the lowest energy 
level is about 10 14 eY and the spacing between levels is likewise of the order 
of 1(T 14 eV Since metals typically possess about 10 22 to 10 23 free electrons 
per cm 3 , the Fermi energy Ep is about 1.5 to 8 eV and the average energy E per 
electron is about 1 to 5 eV Thus, for all practical purposes, the energy of the 
lowest level may be taken as zero and the energy values may be treated as 
continuous. 

The smooth surface of the spherical octant in »-space which defines the 
Fermi energy cuts through some of the unit cubic cells that represent single¬ 
particle states. The replacement of what should be a ragged surface by a 
smooth surface results in a negligible difference because the density of single¬ 
particle states near the Fermi energy Ep is so large that E is essentially 
continuous. At the Fermi energy Ep, the density of single-particle states is 


2jtvm e ( 3 N\ 


1/3 


(8.64) 


which typically is about 10 22 to 10 23 states per eV Thus, near the Fermi energy 
Ep, a differential energy range d E of 10 10 eV contains about 10 11 to 10 12 
doubly occupied single-particle states. 

Since the potential energy of the electrons in the free-electron gas is assumed 
to be zero, all the energy of the mobile electrons is kinetic. The electron 
velocity tip at the Fermi level Ep is given by 

\m e Up = Ep (8.65) 

and the average electron velocity 77 is given by 

\m e u 2 = E = jEp (8.66) 

For electrons in a metal, these velocities are on the order of 10 s cm s _1 . 

The Fermi temperature Tp is defined by the relation 


Ep — k^Tp 


( 8 . 67 ) 
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where k b is Boltzmann’s constant, and typically ranges from 18 000K to 
90 000 K for metals. At temperatures up to the melting temperature, we have 
the relationship 

A'[j T Ey 

Thus, even at temperatures well above absolute zero, the electrons are 
essentially all in the lowest possible energy states. As a result, the electronic 
heat capacity at constant volume, which equals dE tot /dT, is small at ordinary 
temperatures and approaches zero at low temperatures. 

The free-electron gas exerts a pressure on the walls of the infinite potential 
well in which it is contained. If the volume u of the gas is increased slightly by 
an amount do, then the energy levels E nxnr „ z in equation (8.56) decrease 
slightly and consequently the Fermi energy Ey in equation (8.60) and the total 
energy E tot in (8.62) also decrease. The change in total energy of the gas is 
equal to the work — Pdv done on the gas by the surroundings, where P is the 
pressure of the gas. Thus, we have 

D dtftot 3Ad£ F 2NEy 2E to t „ 

dt; 5 dt; 5v 3v 

where equations (8.60) and (8.62) have been used. For a typical metal, the 
pressure P is of the order of 10 6 atm. 


8.6 Bose-Einstein condensation 

The behavior of a system of identical bosons is in sharp contrast to that for 
fermions. At low temperatures, non-interacting fermions of spin s fill the 
single-particle states with the lowest energies, 2s + 1 particles in each state. 
Non-interacting bosons, on the other hand, have no restrictions on the number 
of particles that can occupy any given single-particle state. Therefore, at 
extremely low temperatures, all of the bosons drop into the ground single¬ 
particle state. This phenomenon is known as Bose-Einstein condensation. 

Although A. Einstein predicted this type of behavior in 1924, only recently 
has Bose-Einstein condensation for weakly interacting bosons been observed 
experimentally. In one study, 2 a cloud of rubidium-87 atoms was cooled to a 
temperature of 170 X 10 9 K (170 nK), at which some of the atoms began to 
condense into the single-particle ground state. The condensation continued as 
the temperature was lowered to 20 nK, finally giving about 2000 atoms in the 
ground state. In other studies, small gaseous samples of sodium atoms 2 3 and of 

2 M. H. Anderson, J. R. Ensher, M. R. Matthews, C. E. Wieman, and E. A. Cornell (1995) Science 269, 198. 

3 K. B. Davis, M.-O. Mewes, M. R. Andrews, N. J. van Druten, D. S. Durfee, D. M. Kurn, and W. Ketterle 
(1995) Phys. Rev. Lett. 75, 3969. 



230 Systems of identical particles 

lithium-7 atoms 4-5 have also been cooled sufficiently to undergo Bose-Einstein 
condensation. 

Although we have explained Bose-Einstein condensation as a characteristic 
of an ideal or nearly ideal gas, i.e., a system of non-interacting or weakly 
interacting particles, systems of strongly interacting bosons also undergo 
similar transitions. Liquid helium-4, as an example, has a phase transition at 
2.18 K and below that temperature exhibits very unusual behavior. The proper¬ 
ties of helium-4 at and near this phase transition correlate with those of an ideal 
Bose-Einstein gas at and near its condensation temperature. Although the 
actual behavior of helium-4 is due to a combination of the effects of quantum 
statistics and interparticle forces, its qualitative behavior is related to Bose- 
Einstein condensation. 


Problems 

8.1 Show that the exchange operators P in equation (8.4) and P a p in (8.20) are 
hermitian. 

8.2 Noting from equation (8.10) that 

V(l,2) = 2- 1/2 ('P S + Tfr) 

'h(2, l)-2- 1 / 2 ('P s -^) 

show that W(l, 2) and l h(2, 1) are orthogonal if 'Ifs and *h | are normalized. 

8.3 Verify the validity of the relationships in equation (8.19). 

8.4 Verify the validity of the relationships in equation (8.22). 

8.5 Apply equation (8.12) to show that 'b\ and 'I' / in (8.26) are normalized. 

8.6 Consider two identical non-interacting particles, each of mass m, in a one¬ 
dimensional box of length a. Suppose that they are in the same spin state so that 
spin may be ignored. 

(a) What are the four lowest energy levels, their degeneracies, and their corre¬ 
sponding wave functions if the two particles are distinguishable? 

(b) What are the four lowest energy levels, their degeneracies, and their corre¬ 
sponding wave functions if the two particles are identical fermions? 

(c) What are the four lowest energy levels, their degeneracies, and their corre¬ 
sponding wave functions if the two particles are identical bosons? 

8.7 Consider a crude approximation to the ground state of the lithium atom in which 
the electron-electron repulsions are neglected. Construct the ground-state wave 
function in terms of the hydrogen-like atomic orbitals. 


4 C. C. Bradley, C. A. Sackett, J. J. Tollett, and R. G. Hulet (1995) Phvs. Rev. Lett. 75, 1687. 

5 C. C. Bradley, C. A. Sackett, and R. G. Hulet (1997) Phys. Rev. Lett. 78, 985. 
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8.8 The atomic weight of silver is 107.9 gmol -1 and its density is 10.49 gcm~ 3 . 
Assuming that each silver atom has one conduction electron, calculate 

(a) the Fermi energy and the average electronic energy (in joules and in eV), 

(b) the average electronic velocity, 

(c) the Fermi temperature, 

(d) the pressure of the electron gas. 

8.9 The bulk modulus or modulus of compression B is defined by 



Show that B for a free-electron gas is given by B — 5P/3. 
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In the preceding chapters we solved the time-independent Schrodinger equation 
for a few one-particle and pseudo-one-particle systems: the particle in a box, 
the harmonic oscillator, the particle with orbital angular momentum, and the 
hydrogen-like atom. There are other one-particle systems, however, for which 
the Schrodinger equation cannot be solved exactly. Moreover, exact solutions 
of the Schrodinger equation cannot be obtained for any system consisting of 
two or more particles if there is a potential energy of interaction between the 
particles. Such systems include all atoms except hydrogen, all molecules, non¬ 
ideal gases, liquids, and solids. For this reason we need to develop approxima¬ 
tion methods to solve the Schrodinger equation with sufficient accuracy to 
explain and predict the properties of these more complicated systems. Two of 
these approximation methods are the variation method and perturbation 
theory. These two methods are developed and illustrated in this chapter. 


9.1 Variation method 

Variation theorem 

The variation method gives an approximation to the ground-state energy Eq 
(the lowest eigenvalue of the Hamiltonian operator H) for a system whose 
time-independent Schrodinger equation is 

Hxp n = E n xp n , n = 0,1,2,... (9.1) 

In many applications of quantum mechanics to chemical systems, a knowledge 
of the ground-state energy is sufficient. The method is based on the variation 
theorem : if 0 is any normalized, well-behaved function of the same variables 
as xp n and satisfies the same boundary conditions as y> „, then the quantity 
(S = (0| H\(p) is always greater than or equal to the ground-state energy E 0 

gr=(0|A|0) ^E 0 (9.2) 
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Except for the restrictions stated above, the function 0, called the trial 
function, is completely arbitrary. If 0 is identical with the ground-state eigen¬ 
function 0o, then of course the quantity W equals E 0 . If 0 is one of the 
excited-state eigenfunctions, then S’ is equal to the corresponding excited-state 
energy and is obviously greater than E 0 . However, no matter what trial function 
0 is selected, the quantity <§ is never less than E {) . 

To prove the variation theorem, we assume that the eigenfunctions xp n form 
a complete, orthonormal set and expand the trial function 0 in terms of that set 


0 = ^2 a,, V" 

n 

where, according to equation (3.28) 

a n = (0„|0) 

Since the trial function 0 is normalized, we have 

«■« = (£ dkfk n{fk\fn) 

\ k n k n 


(9.3) 

(9.4) 


^ ^ y ^ ^ k ® n & kn y ^ I @ n | 1 

k n n 

We next substitute equation (9.3) into the integral for in (9.2) and subtract 
the ground-state energy Eo , giving 

& - Eo = (cp\H - E 0 \(p) = 'E'E a *k a n{fk\H - Efxp n ) 

k n 


= a *k a ni E n - Eo)(lpk\f„) = E, I a n\ 2 ( E n ~ E o) (9.5) 

k n n 

where equation (9.1) has been used. Since E„ is greater than or equal to E 0 and 
|«„ | 2 is always positive or zero, we have S’ — E 0 ^ 0 and the theorem is 
proved. 

In the event that 0 is not normalized, then 0 in equation (9.2) is replaced by 
A(p, where A is the normalization constant, and this equation becomes 

gr = |h| 2 <0|//|0) ^ e 0 

The normalization relation is 

(A<p\A<p) = M| 2 (0|0) = 1 


giving 


< 0 |//| 0 > 

6 


E o 


(9.6) 


In practice, the trial function 0 is chosen with a number of parameters X\, 
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X 2 , • • •, which can be varied. The quantity & is then a function of these 
parameters: 'ff(X\, X 2 , ...). For each set of parameter values, the corresponding 
value of %(X U X 2 , ...) is always greater than or equal to the true ground-state 
energy E 0 . The value of X 2 , . . .) closest to E 0 is obtained, therefore, by 
minimizing <% with respect to each of these parameters. Selecting a sufficiently 
large number of parameters in a well-chosen analytical form for the trial 
function 0 yields an approximation very close to E 0 . 

Ground-state eigenfunction 

If the quantity <£ is identical to the ground-state energy E 0 , which is usually 
non-degenerate, then the trial function 0 is identical to the ground-state 
eigenfunction 0 O . This identity follows from equation (9.5), which becomes 

^ \a n \ 2 (E n - E 0 ) = 0 

«(¥= 0 ) 

where the term for n = 0 vanishes because E n — E {) vanishes. This relationship 
is valid only if each coefficient a n equals zero for n f 0. From equation (9.3), 
the normalized trial function 0 is then equal to 0o. Should the ground-state 
energy be degenerate, then the function 0 is identical to one of the ground-state 
eigenfunctions. 

When the quantity is not identical to Eq, we assume that the trial function 
0 which minimizes <% is an approximation to the ground-state eigenfunction 
0o ■ However, in general, W is a closer approximation to E {) than 0 is to 0 O . 


Example: particle in a box 

As a simple application of the variation method to determine the ground-state 
energy, we consider a particle in a one-dimensional box. The Schrodinger 
equation for this system and its exact solution are presented in Section 2.5. The 
ground-state eigenfunction is shown in Figure 2.2 and is observed to have no 
nodes and to vanish at x = 0 and x = a. As a trial function 0 we select 
0 = x(a — x), 0 «£ x a 

= 0, x < 0, x > a 


which has these same properties. Since we have 


( 0 | 0 ) = 


x 2 {a -x) 2 dx = ^~ 
0 4U 


the normalized trial function is 

= ^ x ( a ~ x ^ 


0 *£ x s£ a 


0 
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The quantity <% is, then 

^ 30 ' a ( — f d 2 \ 

<% = {(p\H\<p) = —r (ax — x 2 ) —-— ) (ax — x 2 ) dx 

a 5 J 0 \ 2m dx-/ 



The exact ground-state energy E\ is shown in equation (2.39) to be 
7i 2 h 2 /2ma 2 . Thus, we have 

% = l -^E x = 1.013iii > E\ 

jt- 


giving a 1.3% error. 


Example: harmonic oscillator 

We next consider an example with a variable parameter. For the harmonic 
oscillator, discussed in Chapter 4, we select 

0 = e _c * 2 

as the trial function, where c is a parameter to be varied so as to minimize 
W(c). This function has no nodes and approaches zero in the limits x —>■ ±oo. 
Since the integral (0|0) is 

(•0° / _ \ 1/2 

< ^ > =L e ' j “ d, '=y 

where equation (A. 5) is used, the normalized trial function is 



The Hamiltonian operator H for the harmonic oscillator is given in equation 
(4.12). The quantity 'S(c) is then determined as follows 



h 2 c mar 
2m 8c 
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where equations (A.5) and (A.7) have been used. 

To find the minimum value of we set the derivative dA/dc equal to 
zero and obtain 


dt? h 2 mco 2 
d c 2m 8c 2 

so that 


moo 

C = H 

We have taken the positive square root because the parameter c must be 
positive for 0 to be well-behaved. The best estimate of the ground-state energy 
is then 


% 


OLi 

( ma>\ 

ma) 2 - 

( 2 h' 

2 m ' 

V 2 h ) 

8 

\mct) 


hto 

~Y 


which is the exact result. 

The reason why we obtain the exact ground-state energy in this simple 
example is that the trial function 0 has the same mathematical form as the 
exact ground-state eigenfunction, given by equation (4.39). When the para¬ 
meter c is evaluated to give a minimum value for W , the function 0 becomes 
identical to the exact eigenfunction. 


Excited-state energies 

The variation theorem may be extended in some cases to estimate the energies 
of excited states. Under special circumstances it may be possible to select a 
trial function 0 for which the first few coefficients in the expansion (9.3) 
vanish: ao = a\ = ■ • • = a*-i = 0, in which case we have 


and 


0 = 'Yh Un ^ n 

n(5*k) 


\a n I 2 = 1 

n(^k) 

We assume here that the eigenfunctions ip n in equation (9.1) are labeled in 
order of increasing energy, so that 

Eq E\ E 2 • • • 

Following the same procedure used to prove the variation theorem, we 
obtain 
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% ~ E k = ^ \a n \\E n - E k ) 

n(^k) 

from which it follows that 

^ & E k (9.7) 

Thus, the quantity S is an upper bound to the energy E k corresponding to the 
state ip k . For situations in which (p can be made orthogonal to each exact 
eigenfunction ip 0 , ip\, ..., ip k -i, the coefficients ao, «i, . a k - i vanish 
according to equation (9.4) and the inequality (9.7) applies. 

An example is a one-dimensional system for which the potential energy V(x) 
is an even function of the position variable x. The eigenfunction ip o with the 
lowest eigenvalue E {) has no nodes and therefore must be an even function of x. 
The eigenfunction xp\ has one node, located at the origin, and therefore must 
be an odd function of x. If we select for cp any odd function of x, then (p is 
orthogonal to any even function of x, including ip 0 , and the coefficient ao 
vanishes. Thus, the integral S’ = {<p\H\<p) gives an upper bound to E\ even 
though the ground-state eigenfunction ipo may not be known. 

When the exact eigenfunctions ip 0 , ip x , ..., ip k -i are not known, they may 
be approximated by trial functions <po, (pi, ■ ■ ., <p k ~\ which successively give 
upper bounds for E 0 , Ei, , E k _ i, respectively. In this case, the function (p\ 
is constructed to be orthogonal to (p 0 , <p 2 constructed orthogonal to both (p 0 and 
(pi, and so forth. In general, this method is difficult to apply and gives 
increasingly less accurate results with increasing n. 


9.2 Linear variation functions 

A convenient and widely used form for the trial function (p is the linear 
variation function 

N 

<P = ^2 c iXi (9-8) 

i= 1 

where %i, X 2 , ■ ■ ■, Xn are an incomplete set of linearly independent functions 
which have the same variables and which satisfy the same boundary conditions 
as the exact eigenfunctions ip n of equation (9.1). The functions %, are selected 
to be real and are not necessarily orthogonal to one another. Thus, the overlap 
integral Sy, defined as 

Sy (Xi\Xj) (9.9) 

is not generally equal to dy. The coefficients c, are also restricted to real values 
and are variation parameters to be determined by the minimization of the 
variation integral S’. 
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If we substitute equation (9.8) into (9.6) and define //,, by 

Hy = C Xi\H\%j) 


(9.10) 


we obtain 


W = 


N N 

y y c i c j H 'j 

'=i j=i 

N N 

y y c ‘ c i s ‘i 

1= 1 7=1 


or 


N N N N 

t £ e c i c jSij = e e 

1=1 7=1 1=1 7=1 


(9.11) 


To find the values of the parameters c, in equation (9.8) which minimize W, 
we differentiate equation (9.11) with respect to each coefficient c k (k = 1, 2, 

■■■, N) 


N N 


^££ etc# 


'• +y 


N N 

y y c ‘ c j s v 

1=1 7=1 


_d_ 

dc k 


N N 

y y cic J H v 

1=1 7=1 


and set (OS’ / dc k ) = 0 for each value of k. The first term on the left-hand side 
vanishes. The remaining two terms may be combined to give 


( N N \ N N / Q n \ 

E E c ‘°a H « - j =E|k tJ+t ^) (H: 


d_ 
dc k 


v - ffty) 

£ £(<W, + CidjkXHij - r^ 7 ) 


V N 


1=1 7=1 
N 


N 


y Cj(H kj - %S kj ) + d(H ik - WS ik ) 

7=1 


;=1 


= 0 


where we have noted that (dc,/dc k ) = d ik because the coefficients c, in equa¬ 
tion (9.8) are independent of each other. If we replace the dummy index j by i 
and note that H ik = H ki and S ik = S ki because the functions are real, we 
obtain a set of N linear homogeneous simultaneous equations 

N 

y Ci(H ki - WS ki ) = 0, 

/=l 


k= 1, 2, ..., N 


(9.12) 



9.3 Non-degenerate perturbation theoiy 


239 


Equation (9.12) has the form 


N 


^ a ki Xi = 0 , 


k= 1, 2, N 


(9.13) 


i=i 


an 

A12 

fllAT 

«21 

U22 

U2N 

1 

®N2 ■ ' 

’ ' ®NN 


for which a trivial solution is x,- = 0 for all i. A non-trivial solution exists if, 
and only if, the determinant of the coefficients ay vanishes 


= 0 


This determinant or its equivalent algebraic expansion is known as the secular 
equation. In equation (9.12) the parameters c, correspond to the unknown 
quantities x, in equation (9.13) and the terms (Hy — S’Sy) correspond to the 
coefficients ay. Thus, a non-trivial solution for the N parameters c t exists only 
if the determinant with elements ( Hy — WSy) vanishes 

Hu — S’Sn H\2 (?S\2 ••• Hin — (?Sin 
H 21 — %S 2 \ H 22 — &S 22 ' ' ' H 2 N — W$2N 


H mn — (SS 


NN 


= 0 


(9.14) 


H N\ — H N 2 — %Sn2 ■■ 

The secular equation (9.14) is satisfied only for certain values of <%. Since 
this equation is of degree N in A', there are N real roots 

gr 0 *£ ^ ^_i 

According to the variation theorem, the lowest root g’o is an upper bound to the 
ground-state energy Eq\ Eq S’o. The other roots may be shown 1 to be upper 
bounds for the excited-state energy levels 

E\ f?i, E 2 =£ .. ■, E n _ x <§n-i 


9.3 Non-degenerate perturbation theory 


Perturbation theory provides a procedure for finding approximate solutions to 
the Schrddinger equation for a system which differs only slightly from a system 
for which the solutions are known. The Hamiltonian operator H for the system 
of interest is given by 


H= - 



J. K. L. MacDonald (1933) Phys. Rev. 43, 830. 


1 
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where N is the number of particles in the system. We suppose that the 
Schrodinger equation for this Hamiltonian operator 

Hxp n = E n xp n (9.15) 

cannot be solved exactly by known mathematical techniques. 

In perturbation theory we assume that H may be expanded in terms of a 
small parameter 2 

H = H {0) + XH a) + X 2 H (2) + ■ • • = H {0) + H' (9.16) 

where 

H' = XH (l) +X 2 H a) + (9.17) 

The quantity H i0) is the unperturbed Hamiltonian operator whose orthonormal 
eigenfunctions (/2 0) and eigenvalues E { ^ are known exactly, so that 

tf (0 y„ 0) = pW./h (9.18) 

The operator H' is called the perturbation and is small. Thus, the operator II 
differs only slightly from 27 <0) and the eigenfunctions and eigenvalues of II do 
not differ greatly from those of the unperturbed Hamiltonian operator 27 <0) . 
The parameter X is introduced to facilitate the comparison of the orders of 
magnitude of various terms. In the limit 2 —> 0, the perturbed system reduces 
to the unperturbed system. For many systems there are no terms in the 
perturbed Hamiltonian operator higher than II {1} and for convenience the 
parameter X in equations (9.16) and (9.17) may then be set equal to unity. 

The mathematical procedure that we present here for solving equation (9.15) 
is known as Rayleigh-Schrodinger perturbation theory. There are other 
procedures, but they are seldom used. In the Rayleigh-Schrodinger method, 
the eigenfunctions xp n and the eigenvalues E n are expanded as power series 
in X 

rp n = < } + + X 2 ip (2) + • • • (9.19) 

E„ = y o) + 2£ < „ 1) + X 2 E (2) + ■ ■ • (9.20) 

The quantities xp^ and E\] 1 are the first-order corrections to \p„ and E n , the 
quantities ip (2) and E (2) are the second-order corrections, and so forth. If the 
perturbation H' is small, then equations (9.19) and (9.20) converge rapidly for 
all values of X where 0 =£ 2 =£ l. 

We next substitute the expansions (9.16), (9.19), and (9.20) into equation 
(9.15) and collect coefficients of like powers of 2 to obtain 

iyy„ 0 ) +2(iy y„°> + +2 2 (^ 2 yf + + ^°y„ 2) ) + ■ ■ ■ 

= + 2(4 i y„ 0) + E^) + X\E^ + + E^) +.. . 

(9.21) 
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This equation has the form 


/(e) = bk £k = 0 

k 


where 


bk 


]_(&A 

k\ WA=o 


Since /(e) is identically zero, the coefficients bk all vanish. Thus, the coeffi¬ 
cients of X k on the left-hand side of equation (9.21) are equal to the coefficients 
of X k on the right-hand side. The coefficients of 2° give equation (9.18) for the 
unperturbed system. The coefficients of X yield 

(. H (0) - E - E^)xpf (9.22) 

while the coefficients of X 2 give 

(H (0) - 4 0) )/ ( „ 2) + (H {1) - 4 1) )/ ( „ 1) = -(f/ (2) - E^)xpf (9.23) 
and so forth. 


First-order corrections 

To find the first-order correction Ety to the eigenvalue E n , we multiply 
equation (9.22) by the complex conjugate of / ( „ 0) and integrate over all space to 
obtain 

(€W 0) \^) - e?(€ } l€ } ) = + 4 1} 

where we have noted that ?/> ( n ni is normalized. Since // (0) is hermitian, the first 
integral on the left-hand side takes the form 

= < ) << ) i< i, > 

and therefore cancels the second integral on the left-hand side. The first-order 
correction E \] 1 is, then, the expectation value of the perturbation // (l) in the 
unperturbed state 

E ( ? = (VT\H W \VT) = (9.24) 

The first-order correction to the eigenfunction is obtained by multi¬ 
plying equation (9.22) by the complex conjugate of / ( / ° ) for k / n and 
integrating over all space to give 

<<’l *<%<!>) - 4 C, V><'>> = -«’l mv?) + £< 1) «VS ,) > 

(9.25) 

Noting that the unperturbed eigenfunctions are orthogonal 

(V'flV'f) = d kn 


(9.26) 
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applying the hermitian property of H (0) to the first term on the left-hand side, 
and writing for (?/ ,< / J )) | // <l | |?/) ( rt n) ), we may express equation (9.25) as 

(4 0) - (9.27) 

The orthonormal eigenfunctions ?// 0) for the unperturbed system are as¬ 
sumed to form a complete set. Thus, the perturbation corrections tp^ ] may be 
expanded in terms of the set v// 01 

V4 1} = Y an J^T = a 'y^T 

j j&n) 

where a„j are complex constants given by 

a >j = {vf\ H> ( n) (9.28) 

If the complete set of eigenfunctions for the unperturbed system includes a 
continuous range of functions, then the expansion of ip\] ] must include these 
functions. The inclusion of this continuous range is implied in the summation 
notation. The total eigenfunction ip n for the perturbed system to first order in X 
is, then 

tyn = (1 + Xa nn )xp ( ^ +X Y a nMJ ] (9-29) 

j(¥=n) 

Since the function xp ( ;; 0) is already included in zero order in the expansion of 
xp„, we may, without loss of generality, set a nn equal to zero, so that 

v4° = Y a ^T (930) 

j& «) 

This choice affects the normalization constant of xp n , but has no other 
consequence. Furthermore, equation (9.28) for j = n becomes 

«W> = 0 (9.31) 

showing that with a„„ = 0, the first-order correction tp^ is orthogonal to the 
unperturbed eigenfunction xp^\ 

With the choice a nn = 0, the total eigenfunction xp„ to first order is normal¬ 
ized. To show this, we form the scalar product {ip n \tpn) using equation (9.29) 
and retain only zero-order and first-order terms to obtain 

(v„h,) = (€ ] \€ > )+* X>„.<v TW?) + <M?\€ ) )) 

j(¥=n) 

— ^ T X ^ ~ ( ti n j T" a nj )d n j — 1 
j&n) 

where equation (9.26) has been used. 

Substitution of equation (9.30) into (9.27) gives 
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- £?) E ‘MV’tVf) = (4 01 - 

Mn) 

where again equation (9.26) is utilized. If the eigenvalue Ep ) is non-degen¬ 
erate, then Ep } cannot equal Ep } for all k and n and we can divide by 
(E^ — E ( fP) to solve for a„k 



The situation where E^ ) is degenerate requires a more complex treatment, 
which is presented in Section 9.5. The first-order correction ip\P is obtained by 
combining equations (9.30) and (9.32) 


= - E 


/ 17(0) / 7 ( 0 ) * k 

k{^ n)^ k ^ n 


(9.33) 


Second-order corrections 

The second-order correction Ep ] to the eigenvalue E n is obtained by multi¬ 
plying equation (9.23) by ypp and integrating over all space 

(€W 0) - 4 0) lv4 2) > + <va#‘W> - 

= -(<W 2 W> + 4 2) 

where the normalization of tj)^ ) has been noted. Application of the hermitian 
property of II {<)> cancels the first term on the left-hand side. The third term on 
the left-hand side vanishes according to equation (9.31). Writing Hpp for 
(ypP\ /i i2) \yp l )} ) and substituting equation (9.33) then give 


= i-i {2) - y 

nn / j 


fr (D frd) 

E n nk 11 kn 

r(0) p(0) 

k(^n) h k ~ h n 


= h ( v - y 

nn / v 


' rCO) r(0) 
k{^n) h k ~ h n 


(9.34) 


where we have also noted that fl\p equals fl)p because H (]) is hermitian. 

In many applications there is no second-order term in the perturbed 


Hamiltonian operator so that Hpl is zero. In such cases each unperturbed 
eigenvalue E^p is raised by the terms in the summation corresponding to 
eigenvalues Epp less than Epp and lowered by the terms with eigenvalues EpP 
greater than E^p. The eigenvalue Epp is perturbed to the greatest extent by the 
terms with eigenvalues Ep* close to Ep ] . The contribution to the second-order 
correction Epp of terms with eigenvalues far removed from Ep ] is small. For 
the lowest eigenvalue Ep ] , all of the terms are negative so that Ep ] is negative. 
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We also see that, in these cases, the first-order correction ?//J * to the eigenfunc¬ 
tion determines the second-order correction E { p to the eigenvalue. 

To obtain the second-order perturbation correction ip (2) to the eigenfunction, 
we multiply equation (9.23) by y/j^ for k / n and integrate over all space 

«’l H W> - ^¥< 2) > + (V?\H a WP) - £<„V?W’> 

=+ ^(vTlvT) 

As before, we apply the hermitian property of // (0) , introduce the abbreviation 
and use the orthogonality relation (9.26) to obtain 

(Ef - E^)(ipf\xpf) + - £<: ) <<y,! , > = -H% (9.35) 

We next expand the function y/^ } in terms of the complete set of unperturbed 
eigenfunctions yy 11 

¥? = b ^T ( 9 - 36 ) 

j&n) 

where, without loss of generality, the term j = n may be omitted for the same 
reason that y/^ is omitted in equation (9.30). The coefficients b n/ are complex 
constants given by 

b nj = V„ 2) > (9.37) 

Substitution of equations (9.24), (9.28), (9.30), and (9.37) into (9.35) gives 

(4 01 - £<>„* + V a Mj H " 1 - = -tf® 

j&n) 

or 


H 


bnk 


+ E 

j&n) 


a nj^kj a nkH^rm 


(ET - E »>) 


(9.38) 


Combining equations (9.32), (9.36), and (9.38), we obtain the final result 


■/>'? = E 

K+n) 


— H 


( 2 ) 

kn 




+ E 

j&n) 


M kj n jn 


MM 


(Ef - E^){Ef - Ef) (4 0) - isf) 2 




(9.39) 


Summary 

The non-degenerate eigenvalue E„ for the perturbed system to second order is 
obtained by substituting equations (9.24) and (9.34) into (9.20) to give 
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HP-) _ V' 

nn / , 


//' 


(1)|2 
kn I 




£■( 0 ) _ £-( 0 ) 


(9.40) 


The corresponding eigenfunction xp„ to second order is obtained by combining 
equations (9.19), (9.33), and (9.39) 


V n ~ V'f “A 




( 1 ) 


^(0) ^(0) ^ ^ 
kHti) h k ~ h n 


(0) 


+ 1 2 

k&n) 


— H 


( 2 ) 

kn 


+ E 


H a) H a) 

n kj n jn 


H (l) 

^ kn ** nn 


4 0) - 4 0) jfe) (4 0) - 4 0) )(^ 0) - 4 0) ) (4 0) - e ^) 1 


v4 0) 


(9.41) 


While the eigenvalue E {{ ^ for the unperturbed system must be non-degen¬ 
erate for these expansions to be valid, some or all of the other eigenvalues E^ 
for k ^ n may be degenerate. The summations in equations (9.40) and (9.41) 
are to be taken over all states of the unperturbed system other than the state 
V ( „ 0) . If an eigenvalue Ef ] is g r fold degenerate, then it is included g ; - times in 
the summations. If the unperturbed eigenfunctions have a continuous range, 
then the summations in equations (9.40) and (9.41) must include an integration 
over those states as well. 


Relation to variation method 

If we use the wave function y/ i i ) ' n for the unperturbed ground state as a trial 
function 0 in the variation method of Section 9.1 and set H equal to 
H W) + XH ({ \ then we have from equations (9.2), (9.18), and (9.24) 

2T = <0|&|0> = (Vo 0) |iT (0) + 2iT (1, |0Q O> ) = 4 0) +XE ( 0 l) 

and & is equal to the first-order energy as determined by perturbation theory. If 
we instead use a trial function 0 which contains some parameters and which 
equals 0 q O) for some set of parameter values, then the corresponding energy 
from equation (9.2) is at least as good an approximation as E 1 ^ + aE\I 1 to the 
true ground-state energy. 

Moreover, if the wave function v/ , J ) <)) + hj)\l 1 is used as a trial function 0, then 
the quantity from equation (9.2) is equal to the second-order energy 
determined by perturbation theory. Any trial function 0 with parameters which 
reduces to vy 1 for some set of parameter values yields an approximate 

energy <% from equation (9.2) which is no less accurate than the second-order 
perturbation value. 
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9.4 Perturbed harmonic oscillator 

As illustrations of the application of perturbation theory we consider two 
examples of a perturbed harmonic oscillator. In the first example, we suppose 
that the potential energy V of the oscillator is 

V = \kx 2 + cx 4 = \ma) 2 x 2 + cx 4 


where c is a small quantity. The units of V are those of tun (energy), while the 
units of x are shown in equation (4.14) to be those of ( h/mco ) 1 / 2 . Accordingly, 
the units of c are those of nroE /t> and we may express c as 

2 3 

„ mar 


where X is dimensionless. The potential energy then takes the form 

2 3 4 

i 9 9 m oj x ^ ^ 

V = \ma) 2 x 2 + X --- (9.42) 

n 

The Hamiltonian operator // (0) for the unperturbed harmonic oscillator is 
given by equation (4.12) and its eigenvalues E ( ^ and eigenfunctions y ,(> J } are 
shown in equations (4.30) and (4.41). The perturbation //' is 


H' = H (l) = X 


m 2 a) 3 x 4 

h 


(9.43) 


Higher-order terms // |2) , ... in the perturbed Hamiltonian operator do 

not appear in this example. 

To find the perturbation corrections to the eigenvalues and eigenfunctions, 
we require the matrix elements (n'\x 4 \n) for the unperturbed harmonic 
oscillator. These matrix elements are given by equations (4.51). The first-order 
correction E (4) to the eigenvalue E n is evaluated using equations (9.24), (9.43), 
and (4.51c) 


E (4) = H (4) n = («.|x 4 |«) = |(n 2 + n + \)Xhet) 


(9.44) 


The second-order correction E (2) is obtained from equations (9.34), (9.43), and 
(4.51) as follows 



E ( 2) 
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A(!) 1 

n n—4,n \ 

2 

A(l) 1 

n n-2,n\ 

2 

i/ (1) 

n n-\-2,n \ 

2 

1 

ii «+4,« 1 

2 

E 

(°) _ A 0 ) 77 

n—4 £ 

'(0) _ AO) 77 

71-2 ^77 11 

<0) — E {0) E 

714-2 ^ 77 c 

,f °) _ jt(0) 

«+4 ^ rc 


l 2 w 4 m 6 

"(«- 4|x» 2 

(n — 2|x 4 |«) 2 

(n + 2|x 4 |«) 2 

<w + 4|x» 2 ’ 

h 2 

(—4/ky) 

(—2 fay) 

2 / 74/4 

4/7474 


= -±( 34« 3 + 51« 2 + 59 « + 21 ) 2 2 / m 4 ( 9 . 45 ) 


The perturbed energy E„ to second order is, then 

E n « 4 °) + 4 1 ) + Ef 


= (n + \)ha) + |(n 2 + « + — |(34« 3 + 51« 2 + 59 n + 21 )A 2 hco 

(9.46) 

In the expression (9.45) for the second-order correction E (2 \ the summation 
on the right-hand side includes all states k other than the state «, but only for 
the states ( n — 4), (n — 2), (n + 2), and (n + 4) are the contributions to the 
summation non-vanishing. For the two lowest values of n, giving E { 2) and e\ 2 \ 
only the two terms k = ( n + 2) and k = (n + 4) should be included in the 
summation. However, the terms for the meaningless values k = {n — 2) and 
k = (n — 4) vanish identically, so that their inclusion in equation (9.45) is 
valid. A similar argument applies to E {2) and Ef\ wherein the term for the 
meaningless value k = (n — 4) is identically zero. Thus, equation (9.46) 
applies to all values of n and the perturbed ground-state energy E 0 , for 
example, is 

Eo ~ - 2 iA 2 )hco 


The evaluation of the first- and second-order corrections to the eigenfunc¬ 
tions is straightforward, but tedious. Consequently, we evaluate here only the 
first-order correction ip^ for the ground state. According to equations (9.33), 
(9.43), and (4.51), this correction term is given by 


V ; o ) = - 


H 


(i) 

20 


M 0) - E\ 


v/°) - 
( 0 ) '2 


H' 


(i) 

40 


17(0)_ AO) 

U-4 




Am 2 a) 3 

h 


< 2 Tl<V> 


2 fuo 


, < 4 |-^ 4 |°) (0) 
f 4 hto ^ 


A 


\61pf + ViipT] 


T" V Jf4 J (9.47) 

If the unperturbed eigenfunctions and 4/4° * as given by equation (4.41) are 
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explicitly introduced, then the perturbed ground-state eigenfunction ip 0 to first 
order is 


'll) ^ 7 /,tO) 1 , (i) _ 

ro ~ Vo 'ro 


nu •) 
7th 


1/4 


\-^(4t+12^-9) 


,-! 2 /2 


(9.48) 


As a second example, we suppose that the potential energy V for the 
perturbed harmonic oscillator is 


V = \kx 2 + cx 3 = \ma) 2 x 2 + X 


T 5 

mar 

h 


1/2 


(9.49) 


where c = X^rrv’a) 5 /h) 1 ' 2 is again a small quantity and X is dimensionless. The 
perturbation H’ for this example is 


H’ = H {1) = X 


T 5 

mar 

h 


1/2 


(9.50) 


The matrix elements ( n'\x 3 \n ) for the unperturbed harmonic oscillator are 
given by equations (4.50). The first-order correction term Ety is obtained by 
substituting equations (9.50) and (4.50e) into (9.24), giving the result 


E& = X 


3 5\ 1/2 
mto 


h 


(«|x 3 |«) = 0 


(9.51) 


Thus, the first-order perturbation to the eigenvalue is zero. The second-order 
term E (2) is evaluated using equations (9.34), (9.50), and (4.50), giving the 
result 

E„ « E (2) 


//' 


(i) 

n—3,n 


X 2 m 3 a) 5 


ft 


12 
n \ 

\H (l) I 2 

\ rL n-\,n\ 

\H^ |2 

\ rL n+\,n\ 

I//T) |2 

\ rl n+'},n\ 



77 *°) _ 17 ( 0 ) 

n -1 c n 

Z 7 <°) _ 77 ( 0 ) 

^h+I 

Z7<°) _ f(0) 

c n+3 c n 


( n- 

■3|x 3 |«) 2 (n 

— l|x 3 |«) 2 

(n + l|x 3 |n) 2 

(n + 3 |x 3 \n) 2 

(- 

-3hto) 

(—hat) 

ha) 

3hco 


= -i(30« 2 + 30« + U)X 2 fuo 


(9.52) 


9.5 Degenerate perturbation theory 

The perturbation method presented in Section 9.3 applies only to non-degen¬ 
erate eigenvalues E^ } of the unperturbed system. When E^ is degenerate, the 
denominators vanish for those terms in equations (9.40) and (9.41) in which 
E^ ] is equal to E^\ making the perturbations to E„ and xp n indeterminate. In 
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this section we modify the perturbation method to allow for degenerate 
eigenvalues. In view of the complexity of this new procedure, we consider only 
the first-order perturbation corrections to the eigenvalues and eigenfunctions. 

The eigenvalues and eigenfunctions for the unperturbed system are given by 
equation (9.18), but now the eigenvalue E {{ ^ is g„-fold degenerate. Accord¬ 
ingly, there are g„ eigenfunctions with the same eigenvalue E ( ®\ For greater 
clarity, we change the notation here and denote the eigenfunctions correspond¬ 
ing to as a = 1, 2, ..., g n . Equation (9.18) is then replaced by the 
equivalent expression 

= 4 0) ve a = \,2, ..., g n (9.53) 

Each of the eigenfunctions ip^a is orthogonal to all the other unperturbed 

eigenfunctions v4°« f° r £ ^ n, but is not necessarily orthogonal to the other 

eigenfunctions for E®\ Any linear combination (j) „„ of the members of the set 

#°> 
t na 

gn 

= a=\,2,...,g„ (9.54) 

P= i 

is also a solution of equation (9.53) with the same eigenvalue E^\ As 
discussed in Section 3.3, the members of the set v4°« may be constructed to be 
orthonormal and we assume that this construction has been carried out, so that 

(v4>/™) = d a p, a, 13 = 1, 2, ..., gn (9.55) 

By suitable choices for the coefficients c a p in equation (9.54), the functions 
(j> na may also be constructed as an orthonormal set 

{(pnMna) = dap, Ct, ft = l, 2, . . . , g n (9.56) 

Substitution of equation (9.54) into (9.56) and application of (9.55) give 

gn 

£ c; y c ay = dap, a, P= 1, 2, ..., g n (9.57) 

y=i 

The Schrodinger equation for the perturbed system is 

Hip na E na ip na , cl 1,2, ..., g n (9.58) 

where the Hamiltonian operator H is given by equation (9.16), E na are the 
eigenvalues for the perturbed system, and ip na are the corresponding eigen¬ 
functions. While the unperturbed eigenvalue E \^ is g„-fold degenerate, the 
perturbation H' in the Hamiltonian operator often splits the eigenvalue Ey 
into g n different values. For this reason, the perturbed eigenvalues E na require 
the additional index a. The perturbation expansions of E na and ip na in powers 
of X are 
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E na = Ef + XE^ a + X 2 E^ a + ■ • •, a = \, 2, g n (9.59) 
na = </C + + ^V„« + ’ ’ ’ > a = 1, 2, . . . , g n (9.60) 

Note that in equation (9.59) the zero-order term is the same for all values of a. 

In the limit A —* 0, the Hamiltonian operator II approaches the unperturbed 
operator // (0) and the perturbed eigenvalue E na approaches the degenerate 
unperturbed eigenvalue E^\ The perturbed eigenfunction ?/>„„ approaches a 
function which satisfies equation (9.53), but this limiting eigenfunction may 
not be any one of the initial functions 0^. In general, this limiting function is 
some linear combination of the initial unperturbed eigenfunctions as 
expressed in equation (9.54). Thus, along with the determination of the first- 
order correction terms E^ a and we must find the set of unperturbed 
eigenfunctions <p \® to which the perturbed eigenfunctions reduce in the limit 
X —> 0. In other words, we need to evaluate the coefficients c a p in the linear 
combinations (9.54) which transform the initial set of unperturbed eigenfunc¬ 
tions 0 ( „ ( « into the ‘correct’ set 0^. Equation (9.60) is then replaced by 

M>na = 4> ( nl + V,+ X 2 ^ + ■■■, a = 1, 2, . . . , g n (9.61) 

The first-order equations (9.22) and (9.24) apply here provided the additional 
index a and the ‘correct’ unperturbed eigenfunctions are used 

(H <0) - EfX = <«'" - (9.62) 


Ei = ) (9.63) 


However, equation (9.63) for the first-order corrections to the eigenvalues 
cannot be used directly at this point because the functions are not known. 

To find E\l\ we multiply equation (9.62) by y/^:, , the complex conjugate of 
one of the initial unperturbed eigenfunctions belonging to the degenerate 
eigenvalue E^\ and integrate over all space to obtain 


- ET W>) = -<(/-<„><» - 4‘>l«> 

Applying the hermitian property of // (0) , we see that the left-hand side 
vanishes. Substitution of the expansion (9.54) for 0^ using y as the dummy 
expansion index gives 


y Carin'" - ^l<!) 


7=1 


0 , 


a, p = 1, 2, ..., g n 


If we introduce the abbreviation 

A 7 = 1,2,..., g, 

and apply the orthonormality condition (9.55), this equation takes the form 
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= a,/! =1,2 (9.64) 

y= i 

Note that the integrals are evaluated with the known initial set of 

unperturbed eigenfunctions, in contrast to the integrals in equation (9.63), 
which require the unknown functions . For a given eigenvalue E^ a , the 
expression (9.64) is a set of g n linear homogeneous simultaneous equations, 
one for each value of/3 (J3 = 1,2,..., g n ) 

/3 = 1 : CaiiH^I nl — E^) + C,a H ( J n2 + Ca3^ n } n3 + • ■ • + c agn H ( n \ ngn = 0 


P ~ C «l H ( ' n 2, n \ + C a2(H ( ' n 2 n2 E^) + C„3 H X n l n3 + ■ ■ ■ + C agn H^ ngn — 0 


Ul) 


P — gn- c a\H ( ' ngn )A + C a 2H ng „ n2 


+ Cai H 


CD 

ng„,n3 


+ ’ ’ ’ + c ag„(H\,g„,ng„ na ) 


= 0 


Equation (9.64) has the form of (9.13) with the coefficients c ay correspond¬ 
ing to the unknown quantities x, and the terms ( H ( 'J, ny — E^dpy) correspond¬ 
ing to the coefficients ay,. Thus, a non-trivial solution for the g„ coefficients 
c ay (y = 1,2,..., g„) exists only if the determinant with elements ( H (l J, ny — 
E^d^) vanishes 


T[(l) _ C(l) 

n n 1, n 1 ^ na 

H (l) 

n n\,n2 

H (l) 

nl,ng„ 



H (X) 

n n2,n\ 

T[(i) __ 77(1) . , 

11 n2,n2 ^ na 

n2,ng„ 

= 0 

(9.65) 

77 (1) 

n ng„,n 1 

H (l) 7 

ng„,n2 

//(l) _ Ft 1 ) 

ng„,ng„ na 




Only for some values of the first-order correction term E^ is the secular 
equation (9.65) satisfied. This secular equation is of degree g„ in E ( ^ a , giving 
g n roots 


17(1) 77(1) 

1’ tj n2' 


E’(l) 

n ng„ 


all of which are real because // n 1 is hermitian. The perturbed eigenvalues to 
first order are, then 

E nl =E^ + XE^ 


E 


ngn 



If the g n roots are all different, then in first order the g„-fold degenerate 
unperturbed eigenvalue E ( ;J 0) is split into g„ different perturbed eigenvalues. In 
this case, the degeneracy is removed in first order by the perturbation. We 
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assume in the continuing presentation that all the roots are indeed different 
from each other. 


‘ Correct ’ zero-order eigenfunctions 

The determination of the coefficients c ay is not necessary for finding the first- 
order perturbation corrections to the eigenvalues, but is required to obtain the 
‘correct’ zero-order eigenfunctions and their first-order corrections. The coeffi¬ 
cients c ay for each value of a (a = 1 , 2 , ..., g„) are obtained by substituting 
the value found for E\^ from the secular equation (9.65) into the set of 
simultaneous equations (9.64) and solving for the coefficients c a 2 , ..., c agll in 
terms of c a The normalization condition (9.57) is then used to determine c a \. 
This procedure uniquely determines the complete set of coefficients c ay (a, 
y = 1 , 2 ,..., g„) because we have assumed that all the roots £'■ ',) are different. 

If by accident or by clever choice, the initial set of unperturbed eigenfunc¬ 
tions V ( „°d is actually the ‘correct’ set, i.e., if in the limit 2-^0 the perturbed 
eigenfunction ip na reduces to r/ 2 °d f° r a il values of a, then the coefficients c ay 
are given by c ay = <\ iy and the secular determinant is diagonal 

ttA) _ 77 O) 0 ... 0 

n n\,n\ c na u u 

0 A'L-bJi ••• 0 - 0 

0 0 ••• 

The first-order corrections to the eigenvalues are then given by 

4’d = a = 1,2, ...,g n (9.66) 

It is obviously a great advantage to select the ‘correct’ set of unperturbed 
eigenfunctions as the initial set, so that the simpler equation (9.66) may be 
used. A general procedure for achieving this goal is to find a hermitian operator 
A that commutes with both il {{)] and //’ 11 and has eigenfunctions with non¬ 
degenerate eigenvalues u a _, so that 

[A, H (0) ] = [A, H (l) ] = 0 (9.67) 

and 


AXa — ft-aXa 

Since A and // (0) commute, they have simultaneous eigenfunctions. Therefore, 
we may select xi, X 2 , ■ ■ ■, Xg„ as ^e initial set of unperturbed eigenfunctions 

f { na=Xa, a = 1, 2, . . . , g„ 

We next form the integral (xp\[A, II {]} ]\x<t) (ft f a), which of course vanishes 
according to equation (9.67), 
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(XpU, H W ]\ Xa ) = (xp\AHV\ Xa ) - ( Xfj \H^A\ Xa ) 

= (A X p\H {l) \ Xa ) ~ t*a(Xp\H {l) \ Xa ) 

= (Mf! - ^a){^\H (l) \lpZ) 

= (jXp - pa)H ( H na 

= 0 

Since pp f u a , the off-diagonal elements //*',! na equal zero and the set of 
functions V ( ,°d = Xa ' s the ‘correct’ set. The parity operator FI discussed in 
Section 3.8 can often be used in this context for selecting ‘correct’ unperturbed 
eigenfunctions. 


First-order corrections to the eigenfunctions 

To obtain the first-order corrections to the eigenfunctions tp na , we multiply 
equation (9.62) by for k f n and integrate over all space 

(<J|H <0) - Civ© = -(<il» <,) l«> + 

Applying the hermitian property of // (0) and noting that ij/fj, is orthogonal to 
all eigenfunctions belonging to the eigenvalue E®\ we have 

(E m _ <<>m 

We next expand the first-order correction in terms of the complete set of 
unperturbed eigenfunctions 


*2 = E E CtnaWfy (9-69) 

K¥=n) r =1 


where the terms with j = n are omitted for the same reason that they are 
omitted in equation (9.30). Substitution of equations (9.54) and (9.69) into 
(9.68) gives 

(£f - MI") E E “•vjMI l<’> = - E c «y 

ji+n) y= 1 7=1 


In view of the orthonormality relations, the summation on the left-hand side 
may be simplified as follows 

Si gj 

a naJy{^ktjVpSv) = a na,jy&kj&fiy = ^na,kft 

K¥=n) 7=1 K+n) 7=1 


Therefore, we have 
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@lia.kft 


' Ca Y^k},ny 

(E? - 4 0> ) 


(9.70) 


The eigenfunctions xp na for the perturbed system to first order are obtained by 
combining equations (9.61), (9.69), and (9.70) 

gn 


k&n) 


fna = (t>Z 


c H {1) 

Ca Y rl k/t,ny 


T 

g k 

S (E?-E^) 


V 1 ) 

r kjt 


(9.71) 


Example: hydrogen atom in an electric field 

As an illustration of the application of degenerate perturbation theory, we 
consider the influence, known as the Stark effect, of an externally applied 
electric field W on the energy levels of a hydrogen atom. The unperturbed 
Hamiltonian operator /7 (0) for the hydrogen atom is given by equation (6.14), 
and its eigenfunctions and eigenvalues are given by equations (6.56) and 
(6.57), respectively. In this example, we label the eigenfunctions and eigenva¬ 
lues of // |(,) with an index starting at 1 rather than at 0 to correspond to the 
principal quantum number n. The perturbation //' is the potential energy for 
the interaction between the atomic electron with charge — e and an electric field 
W directed along the positive z-axis 

H' = H (l) = e&z = e<S rcos0 (9.72) 

If spin effects are neglected, the ground-state unperturbed energy level 
is non-degenerate and its first-order perturbation correction E \ 11 is given by 
equation (9.24) as 

E { \ ] = et? (ls|z| Is) = 0 

This integral vanishes because the unperturbed ground state of the hydrogen 
atom, the Is state, has even parity and z has odd parity. 

The next lowest unperturbed energy level Ef*, however, is four-fold degen¬ 
erate and, consequently, degenerate perturbation theory must be used to 
determine its perturbation corrections. For simplicity of notation, in the 
quantities yff, and //^ n/ , we drop the index n, which has the value 
n = 2 throughout. As the initial set of eigenfunctions for the unperturbed 
system, we select the 2s, 2p 0 , 2pi, and 2p^i atomic orbitals as given by 
equations (6.59) and (6.60), so that 
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ip\ 0) = |2s), ipf = |2p 0 > 

xpf = |2p!>, = \ 2 P-i) 

The ‘correct’ set of unperturbed eigenfunction 0^ O) are, then 

4 

<t> { a =^2 c apipp ) = c«i|2s) + c a2 |2p 0 ) + c a3 |2pi) + c a4 |2p^i), 
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(9.73) 


8=1 


a = 1, 2, 3, 4 
The matrix elements in this example are 

H { a(! = e^i'PaM'pf) = e^{xp^\r cos d\xpf) 


(9.74) 


= e<% 


*2 Jt 

*JT 

0 . 

0 . 


r cos 6 f ' z s i n $ dr d 6 dtp 


(9.75) 


These matrix elements vanish unless Am = 0 and Al = 1. Thus, only the 
matrix element which equals is non-zero. 

To evaluate the matrix element H\\\ we substitute the 2s wave function from 
equation (6.59) and the 2p 0 wave function from equation (6.60a) into (9.75) 

i>2?r 


= H { 2 \ = eW[ji(2a { tf] 


,4n — 1 


0 


1 - —)e- r/flo dr 

2a 0 J 


cos 2 0sin0d0 


o 


dtp 


o 


= —3ei§fao 

where equations (A.26) and (A.28) are used. 
The secular determinant (9.65) is 


- E l ] 

-3 e%a 0 -E {1) 


—le&cio 

o; 

2 


0 

0 


0 

0 


0 

0 

—E 1 


-(i) 


0 

0 

0 

-E\ 


.(1) 


= 0 


which expands to 


[( 4 1) ) 2 - OeSa^JE^f = 0 


The four roots are E^’ = —3 ecoci^, 3eE:a () , 0, 0, so that to first order the 
perturbed energy levels are 


-e’ 2 

E21 = - 3 ec ao. 


E 23 — 


—e 


'2 


E 22 ~ 


— e 


'2 


-(- 3<"' ti{\. 


E 24 — 


8ao 

—e' 2 


(9.76) 


8n 0 ' 8<7 o 

The four linear homogeneous simultaneous equations (9.64) are 
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~CalE £> + c a2 H^_ = 0 

C«1 #12 - C «2 4 1} = 0 

“ (9.77) 

= 0 

-Cc^ = 0 

To find the ‘correct’ set of unperturbed eigenfunctions <p^\ we substitute first 
4 ' > = — 3e^ao, then successively = 3eZ?ao, 0 , 0 into the set of equations 


(9.77). The results are as follows 


for 4 ° = #12 = -3 e^a 0 : 

c l — C 2 ; c 3 — c 4 — 0 

for = -# 1 ? = 3 e^a 0 : 

c\ = — c 2 ; C 3 = C 4 = 0 

for 4 * = 0: 

c i = c 2 = 0 ; C 3 and C 4 undetermined 


Thus, the ‘correct’ unperturbed eigenfunctions are 

0 (O) = 2 -i/ 2 (| 2s ) + | 2 p 0 » 

<t>? = 2- 1/2 (|2s> - |2p 0 » 

03 O) = |2pi) 

04 O) = |2p_l> 

The factor 2 -1 / 2 is needed to normalize the ‘correct’ eigenfunctions. 


9.6 Ground state of the helium atom 


In this section we examine the ground-state energy of the helium atom by 
means of both perturbation theory and the variation method. We may then 
compare the accuracy of the two procedures. 

The potential energy V for a system consisting of two electrons, each with 
mass m c and charge —e, and a nucleus with atomic number Z and charge +Ze 
is 


Ze ' 2 Ze' 2 e' 2 

V = -b — 

r\ r 2 r u 

where r\ and r 2 are the distances of electrons 1 and 2 from the nucleus, r l2 is 
the distance between the two electrons, and e' = e for CGS units or 
e' = e/(4rrco ) l/2 for SI units. If we assume that the nucleus is fixed in space, 
then the Hamiltonian operator for the two electrons is 


H = - 


2 m e 


(V 2 + V 2 ) 


Ze ' 2 


Ze' 2 e' 2 

-1- 

ri ri 2 


(9.79) 
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The operator H applies to He for Z = 2, Li + for Z = 3, Be 2+ for Z = 4, and 
so forth. 


Perturbation theory treatment 

We regard the term e' 2 /r u in the Hamiltonian operator as a perturbation, so 
that 

e' 2 

H' = H (l) = — (9.80) 

r\i 

In reality, this term is not small in comparison with the other terms so we 
should not expect the perturbation technique to give accurate results. With this 
choice for the perturbation, the Schrodinger equation for the unperturbed 
Hamiltonian operator may be solved exactly. 

The unperturbed Hamiltonian operator is the sum of two hydrogen-like 
Hamiltonian operators, one for each electron 

H (0) = H[ 0) + Hf 

where 


fjW) = _ . fl T72 

1 - n 

Ze' 2 


M 0) = - 


2m, 

fi 2 


V 2 - 


2m e *■ r 2 

If the unperturbed wave function y/ (>> is written as the product 

V ,(0) ( r l» r 2) = y>f\r l)V’2 0) (l*2) 

and the unperturbed energy E (i}> is written as the sum 

E (0) = E W) + E (0) 

then the Schrodinger equation for the two-electron unperturbed system 
tf(°y o) (ri, r 2 ) = Eip (0 \r u r 2 ) 


separates into two independent equations, 

= i = 1,2 

which are identical except that one refers to electron 1 and the other to electron 
2. The solutions are those of the hydrogen-like atom, as discussed in Chapter 6. 
The ground-state energy for the unperturbed two-electron system is, then, twice 
the ground-state energy of a hydrogen-like atom 


£ (0) = -2 


Z 2 e' 2 

2a 0 


Z 2 e' 2 

ao 


( 9 - 81 ) 


The ground-state wave function for the unperturbed two-electron system is the 
product of two Is hydrogen-like atomic orbitals 
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Table 9.1. Ground-state energy of the helium 
atom 


Method 

Energy (eV) 

% error 

Exact 

-79.0 


Perturbation theory: 

£(°) 

-108.8 

-37.7 

E (0) + ES 0 

-74.8 

+5.3 

Variation theorem (Z) 

-77.5 

+ 1.9 


f (0) (r u r 2 ) = 

jt \ao 

where we have defined 


1 (Z 


— p-Zn/ao -Zr 2 /a 0 1 p ~(Pi+P2>/2 


1 (Z 


pi = 


2 Zrj 

a 0 


Jt \ao 


i= 1,2 


(9.82) 


(9.83) 


The first-order perturbation correction E n> to the ground-state energy is 
obtained by evaluating equation (9.24) with (9.80) as the perturbation and 
(9.82) as the unperturbed eigenfunction 


(0) 


E (l) = ( xj) 


where Pn = \pi — P\ \ and where 


e' 2 

^A= — { 

y o) 

e' 2 

n 2 

/ flo 

\ 

Pn 


, m \ Ze' 2 

lp {0) ) = 

/ 2 5 jr 2 ao 


(9.84) 


/ = 


p-(Pl+P2) 

C .2 „2 


Pn 


p { P 2 sinsin02 dpi d0i dcpi dp 2 d0 2 dcp 2 (9.85) 


This six-fold integral I is evaluated in Appendix J and is equal to 20jt 2 . Thus, 
we have 


E A) = 5Zg '~ = _ E (0) 


8ao 


8 Z 


(9.86) 


The ground-state energy of the perturbed system to first order is, then 

E = E (0) + £ n) = - (Z 2 - ^ — (9.87) 

V 8 / «o 

Numerical values of E W) and E {{)) + E {l ) for the helium atom (Z = 2) are 
given in Table 9.1 along with the exact value. The unperturbed energy value 
£ (0) has a 37.7% error when compared with the exact value. This large 
inaccuracy is expected because the perturbation H' in equation (9.80) is not 
small. When the first-order perturbation correction is included, the calculated 
energy has a 5.3% error, which is still large. 
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Variation method treatment 

As a normalized trial function 0 for the determination of the ground-state 
energy by the variation method, we select the unperturbed eigenfunction 
V (0) (n> r 2 ) of the perturbation treatment, except that we replace the atomic 
number Z by a parameter Z' 

0 = 0102 


01 


1 

Tf 1 / 2 



3/2 

e -ZVi/ao 


(9.88) 


02 


1 



3/2 

Q -Z'r 2 /ao 


The parameter Z' is an effective atomic number whose value is determined by 
the minimization of <% in equation (9.2). Since the hydrogen-like wave func¬ 
tions 0i and 02 are normalized, we have 

(0i|0i) = {02102) = 1 (9.89) 


The quantity t% is obtained by combining equations (9.2), (9.79), (9.88), and 
(9.89) to give 


% 



h 2 2 Ze' 2 

2^ v '-vr 


0i 


+ ( 02 


h 2 „ Ze' 2 
— V^- — 
2/« e 7*2 


02 


+ 




(9.90) 


Note that while the trial function 0 = 0102 depends on the parameter Z', the 
Hamiltonian operator contains the true atomic number Z. Therefore, we rewrite 
equation (9.90) in the form 


% 



2 m e 


>„'2 


7*1 


0i ) + ( 0i 


(Z' - Z)e' 2 
7*1 


01 


+ ( 02 


7*2 


2 W e 


02 ) + ( 02 


(Z' - Z)e' 2 
7*2 


02 


+ 




(9.91) 


The first term on the right-hand side is just the energy of a hydrogen-like atom 
with nuclear charge Z', namely, —Z' 2 e' 2 /2a^. The third term has the same 
value as the first. The second term is evaluated as follows 
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<P\ 


(Z' - Z)e' 2 

n 


0! = (Z' - Z)e' 2 


1 (Z'\ 


- 


= (Z' - Z)e — 
«o 


Ji \a 0 J 
Z' 


-1 2 Z' 


'' 1 /«°4 Jrr 2 ^ 


where equations (A.26) and (A.28) have been used. The fourth term equals the 
second. The fifth term is identical to E (l) of the perturbation treatment given by 
equation (9.86) except that Z is replaced by Z' and therefore this term equals 
5Z'e' 2 /Sa 0 . Thus, the quantity W in equation (9.91) is 


(3=2 


Z' 2 e' 2 


2 a 0 


+ 2 


Z'(Z' - Z)e 




a 0 


+ 


5Z'e' 2 


j2 


= [Z' 2 -2(Z-A)Z'] — 


8uq 


fl 0 
(9.92) 


We next minimize <% with respect to the parameter Z' 

— = 2[Z'-(Z-^)1—-0 
dZ' " 4Z (Z 16>J a 0 U 


so that 


7' — 7 T 

A — A 16 


Substituting this result into equation (9.92) gives 

^ = -(Z-±f—^E 0 
a 0 

as an upper bound for the ground-state energy E 0 . 

When applied to the helium atom (Z = 2), this upper bound is 

. , , . „ y2 

gr = -| 


(9.93) 


27 

16 


2 ,2 

e e 

— = -2.85 — 
a o no 


(9.94) 


The numerical value of W is listed in Table 9.1. The simple variation function 
(9.88) gives an upper bound to the energy with a 1.9% error in comparison 
with the exact value. Thus, the variation theorem leads to a more accurate 
result than the perturbation treatment. Moreover, a more complex trial function 
with more parameters should be expected to give an even more accurate 
estimate. 


Problems 


9.1 The Hamiltonian operator for a hydrogen atom in a uniform external electric 
field E along the z-coordinatc axis is 




r 
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Use the variation trial function 0 = i/q s (1 +Az), where 2 is the variation 
parameter, to estimate the ground-state energy for this system. 

9.2 Apply the gaussian function 

0 = e - lr2/a2 ° 

where 2 is a parameter, as the variation trial function to estimate the energy of 
the ground state of the hydrogen atom. What is the percent error? 

9.3 Apply the variation trial function <p(x) — x(a — x)(a — 2x) to estimate the energy 

of a particle in a box with V(x) — 0 for 0 x a, V(x) — oo for x < 0, x > a. 

To which energy level does this estimate apply? 

9.4 Consider a particle in a one-dimensional potential well such that 

V(x) — (bfr/ma 4 )x(x — a), 0 =£ x =£ a 

— oo, x <0, x> a 

where b is a dimensionless parameter. Using the particle in a box with V(x) — 0 
for 0 ^ x ^ a, V(x) — oo for x < 0, x > a as the unperturbed system, calculate 
the first-order perturbation correction to the energy levels. (See Appendix A for 
the evaluation of the resulting integrals.) 

9.5 Consider a particle in a one-dimensional potential well such that 

V(x) — (bh 2 /ma 3 )x, 0 =£ x a 

— oo, x< 0, x > a 

where b is a dimensionless parameter. Using the particle in a box with V(x) — 0 
for 0 ^ x ^ a, V(x) — oo for x < 0, x > a as the unperturbed system, calculate 
the first-order perturbation correction to the energy levels. (See Appendix A for 
the evaluation of the resulting integral.) 

9.6 Calculate the second-order perturbation correction to the ground-state energy for 
the system in problem 9.5. (Use integration by parts and see Appendix A for the 
evaluation of the resulting integral.) 

9.7 Apply the linear variation function 

<fi — c\(2/a )'/ 2 sin(jrx/a) + C 2 ( 2 /fl) 1/,2 sin(2jrx/a) 

for 0 ^ x ^ a to the system in problem 9.5. Set the parameter b in the potential 
equal to it 1 / 8. Solve the secular equation to obtain estimates for the energies E\ 
and E 2 of the ground state and first-excited state. Compare this estimate for E\ 
with the ground-state energies obtained by first-order and second-order perturba¬ 
tion theory. Then determine the variation functions (pi and (pj that correspond to 
E\ and E 2 . 

9.8 Consider a particle in a one-dimensional champagne bottle 2 for which 

V(x) — (jr 2 h 2 /8ma 2 )sm(jrx/a), 0 x a 

— oo, x < 0, x > a 

2 G. R. Miller (1979) J. Chem. Educ. 56, 709. 
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Calculate the first-order perturbation correction to the ground-state energy level 
using the particle in a box with V(x) — 0 for 0 =£ x a, V(x) = oo for x < 0, 
x> a as the unperturbed system. Then calculate the first-order perturbation 
correction to the ground-state wave function, terminating the expansion after the 
term k — 5. (See Appendix A for trigonometric identities and integrals.) 

9.9 Using first-order perturbation theory, determine the ground-state energy of a 
hydrogen atom in which the nucleus is not regarded as a point charge. Instead, 
regard the nucleus as a sphere of radius b throughout which the charge +e is 
evenly distributed. The potential of interaction between the nucleus and the 
electron is 


V(f) 




0 r =£ b 


—e 


r> b 


The unperturbed system is, of course, the hydrogen atom with a point nucleus. 
(Inside the nuclear sphere, the exponential 

e - 2 r/ ao = 1 _ (2r/flo) + (2r 2 /al) - 


may be approximated by unity because r is very small in that region.) 

9.10 Using first-order perturbation theory, show that the spin-orbit interaction energy 
for a hydrogen atom is given by 


W\E^\ 


w(/ + 2 ){l H“ 1) 


4«X 0) I 


nl(l + 2 ) 


for j = l + \, l ^0 
for j — l — l yk 0 


The Hamiltonian operator is given in equation (7.33), where H 0 represents the 
unperturbed system and H so is the perturbation. Use equations (6.74) and (6.78) 
to evaluate the expectation value of £(r). 
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Molecular structure 


A molecule is composed of positively charged nuclei surrounded by electrons. 
The stability of a molecule is due to a balance among the mutual repulsions of 
nuclear pairs, attractions of nuclear-electron pairs, and repulsions of electron 
pairs as modified by the interactions of their spins. Both the nuclei and the 
electrons are in constant motion relative to the center of mass of the molecule. 
However, the nuclear masses are much greater than the electronic mass and, as 
a result, the nuclei move much more slowly than the electrons. Thus, the basic 
molecular structure is a stable framework of nuclei undergoing rotational and 
vibrational motions surrounded by a cloud of electrons described by the 
electronic probability density. 

In this chapter we present in detail the separation of the nuclear and 
electronic motions, the nuclear motion within a molecule, and the coupling 
between nuclear and electronic motion. 


10.1 Nuclear structure and motion 

We consider a molecule with Q nuclei, each with atomic number Z a and mass 
M a (a = 1,2, , Q), and N electrons, each of mass m e . We denote by Q the 

set of all nuclear coordinates and by r the set of all electronic coordinates. The 
positions of the nuclei and electrons are specified relative to an external set of 
coordinate axes which are fixed in space. 

The Hamiltonian operator H for this system of Q + N particles may be 
written in the form 


H = Tq + V Q + H e 

where Tq is the kinetic energy operator for the nuclei 


( 10 . 1 ) 
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Q 

= -h 2 ^ 


^2M a 

a— 1 “ 


( 10 . 2 ) 


Vq is the potential energy of interaction between nuclear pairs 

£ W- 

a <p=\ r *e 


(10.3) 


and H e is the electronic Hamiltonian operator 


H e = - 


2 m e 


N 


E 

(=i 


Q N 

EE- 

a=l i= 1 




/2 


« /2 

\ e 




(10.4) 


t<J= i y 


The symbols V 2 and V 2 are, respectively, the laplacian operators for a single 
nucleus and a single electron. The variable r a p is the distance between nuclei a 
and (i, r ai the distance between nucleus a and electron i, and r,y the distance 
between electrons i and j. The summations are taken over each pair of 
particles. The quantity e' is equal to the magnitude of the electronic charge e in 
CGS units and to e/(4n£o) l /2 in SI units, where £o is the permittivity of free 
space. 

The Schrodinger equation for the molecule is 


HW(r, Q) = W(r, Q) 


(10-5) 


where ^(r, Q) is an eigenfunction and E the corresponding eigenvalue. The 
differential equation (10.5) cannot be solved as it stands because there are too 
many variables. However, approximate, but very accurate, solutions may be 
found if the equation is simplified by recognizing that the nuclei and the 
electrons differ greatly in mass and, as a result, differ greatly in their relative 
speeds of motion. 


Born—Oppenheimer approximation 

The simplest approximate method for solving the Schrodinger equation (10.5) 
uses the so-called Born-Oppenheimer approximation. This method is a two- 
step process. The first step is to recognize that the nuclei are much heavier than 
an electron and, consequently, move very slowly in comparison with the 
electronic motion. Thus, the electronic part of the Schrodinger equation may 
be solved under the condition that the nuclei are motionless. The resulting 
electronic energy may then be determined for many different fixed nuclear 
configurations. In the second step, the nuclear part of the Schrodinger equation 
is solved by regarding the motion of the nuclei as taking place in the average 
potential field created by the fast-moving electrons. 

In the first step of the Born-Oppenheimer approximation, the nuclei are all 
held at fixed equilibrium positions. Thus, the coordinates Q do not change with 
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time and the kinetic energy operator fg in equation (10.2) vanishes. The 
Schrodinger equation (10.5) under this condition becomes 

(H e + V g )rp K (r, Q) = e K (Q)ip K (r, Q) (10.6) 

where the coordinates Q are no longer variables, but rather are constant 
parameters. For each electronic state k, the electronic energy e K (Q) of the 
molecule and the eigenfunction ip K (r, Q) depend parametrically on the fixed 
values of the coordinates Q. The nuclear-nuclear interaction potential Vg is 
now a constant and its value is included in £ K (Q). 

We assume in this section and in Section 10.2 that equation (10.6) has been 
solved and that the eigenfunctions xp K { r, Q) and eigenvalues £ K (Q) are known 
for any arbitrary set of values for the parameters Q. Further, we assume that the 
eigenfunctions form a complete orthonormal set, so that 

ip*(r, Q)(M r , Q) dr = d,a (10.7) 

In the second step of the Born-Oppenheimer approximation, the energy 
£ k (Q) is used as a potential energy function to treat the nuclear motion. In this 
case, equation (10.5) becomes 

[T q + £ k (Q)]Zk V (Q) = E kv x K v(Q) (10.8) 

where the nuclear wave function Xkv(Q) depends on the nuclear coordinates Q 
and on the electronic state k. Each electronic state k gives rise to a series of 
nuclear states, indexed by v. Thus, for each electronic state k, the eigenfunc¬ 
tions of [Tq + e K (Q)] are Xkv(Q) with eigenvalues E KV . In practice, the nuclear 
states are differentiated by several quantum numbers; the index v represents, 
then, a set of these quantum numbers. In the solution of the differential 
equation (10.8), the nuclear coordinates Q in £ K (Q) are treated as variables. 
The nuclear energy E KV , of course, does not depend on any parameters. Most 
applications of equation (10.8) are to molecules in their electronic ground 
states (k = 0). 

In the original mathematical treatment 1 of nuclear and electronic motion, M. 
Born and J. R. Oppenheimer (1927) applied perturbation theory to equation 
(10.5) using the kinetic energy operator Tq for the nuclei as the perturbation. 
The proper choice for the expansion parameter is 2 = (m e /M) l i 4 , where M is 
the mean nuclear mass 



When terms up to 2 2 are retained, the exact total energy of the molecule is 

1 M. Bom and J. R. Oppenheimer (1927) Ann. Physik 84, 457. 
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approximated by the energy E KV of equation (10.8) and the eigenfunction 
H ; (Q, r) is approximated by the product 

^(Q, r) « Xkv(Q)Vk(Q, r) (10.9) 

Perturbation terms in the Hamiltonian operator up to A 4 still lead to the 
uncoupling of the nuclear and electronic motions, but change the form of the 
electronic potential energy function in the equation for the nuclear motion. 
Rather than present the details of the Bom-Oppenheimer perturbation expan¬ 
sion, we follow instead the equivalent, but more elegant procedure 2 of M. Born 
and K. Huang (1954). 


Born—Huang treatment 

Under the assumption that the Schrodinger equation (10.6) has been solved for 
the complete set of orthonormal eigenfunctions tp K (r, Q), we may expand the 
eigenfunction *P(r, Q) of equation (10.5) in terms of tp K ( r, Q) 

^(r, Q) = ]T;a(Q)<Mr, Q) (10.10) 

i 

where y;(Q) are the expansion coefficients. Substitution of equation (10.10) 
into (10.5) using (10.1) gives 


+V Q + H e - E)xMWi( r, Q) = 0 (10.11) 

A 

where the operators have been placed inside the summation. Since the operator 
H e commutes with the function ~/j(Q), we may substitute equation (10.6) into 
(10.11) to obtain 


YA-TQ + U.(Q) - E] Xx (Qm(r, Q) = 0 (10.12) 


We next multiply equation (10.12) by Q) and integrate over the set of 
electronic coordinates r, giving 


E 


rjr- Q)r e ta(Q)fi(r, Q)]dr + [c,(Q) - i] X; (Ql = 0 (10.13) 


where we have used the orthonormal property (equation (10.7)). The operator 
Tq acts on both functions in the product yUQp/’Ur, Q) and involves the 
second derivative with respect to the nuclear coordinates Q. To expand the 
expression T Q [xAQ)xpi{r, Q)], we note that 


2 M. Bom and K. Huang (1954) Dynamical Theoiy of Ciystal Lattices (Oxford University Press, Oxford), 
pp. 406-7. 
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VaXV = yV a% + alp 


= ^\X + XK*P + 2V aX ' VaV 


Therefore, we obtain 


Q i 

TglXxi Q)^(r, Q)] = -h 2 ^_V 2 a fc(Q)^(r, Q)] 

a= 1 a 


= ^(r, Q)T^a(Q) + Xj.(Q)TQ X Pl( r i Q) 

Q i 

- h 1 Y1 XT V “*AQ)' V «^( r ’ Q) 


M a 

a— 1 


Substitution of equation (10.14) into (10.13) yields 

[Tq + e, c (Q) — /qto^Q) + 'y^(c K i + A, C ;J^(Q) = 0 

A 

where the coefficients c K ;,(Q) and the operators A,,; : are defined by 
c,a(Q) = v£( r * Q)t Q ^ k {r, Q)dr 


(10.14) 

(10.15) 

(10.16) 


Q | * 

A k a = -^ 2 ^— ip*(r, Q)V a ^(r, Q)dr• V a (10.17) 

, iKZ /7 

a—1 17 

and equation (10.7) has been used. Since we have assumed that the electronic 
eigenfunctions ip K {r, Q) are known for all values of the parameters Q, the 
coefficients c:y;(Q) and the operators A,.;_ may be determined. The set of 
coupled equations (10.15) for the functions Xk(Q) is exact. 

The integral / contained in the operator A KK is 

/ = ip*(r, Q)V a t/; K (r, Q)dr 

For stationary states, the eigenfunctions ip K {r, Q) may be chosen to be real 
functions, so that this integral can also be written as 

I = \y a [V/c(r, Q)] 2 dr 

According to equation (10.7), the integral / vanishes and, therefore, we have 
Kk = o. 

We now write equation (10.15) as 

[Tq + U K ( Q) - E] Xk ( Q) + ]T (c K , + KMQ) = 0 (10.18) 

A(^K-) 


where U K (Q) is defined by 
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U k (Q) = £ K (Q) + c kk (Q ) 


(10.19) 


The first term on the left-hand side of equation (10.18) has the form of a 
Schrodinger equation for nuclear motion, so that we may identify the expansion 
coefficient % K (Q) as a nuclear wave function for the electronic state k. The 
second term couples the influence of all the other electronic states to the 
nuclear motion for a molecule in the electronic state k. 

If the coefficients c KK (Q) and c,a(Q) and the operators are sufficiently 
small, the summation on the left-hand side of equation (10.18) and c KK ( Q) in 
(10.19) may be neglected, giving a zeroth-order equation for the nuclear 
motion 


[Tq + £ k (Q) - E ( X(Q) = 0 ( 10 . 20 ) 

where ^(Q) and are the zeroth-order approximations to the nuclear wave 
functions and energy levels. The index v represents a set of quantum numbers 
which determine the nuclear state. The neglect of these coefficients and 
operators is the Born-Oppenheimer approximation and equation (10.20) is 
identical to (10.8). Furthermore, the molecular wave function v F(r, Q) in 
equation (10.10) reduces to the product of a nuclear and an electronic wave 
function as shown in equation (10.9). 

When the coupling coefficients c:y;_ for k ^ X and the coupling operators 
are neglected, but the coefficient c KK ( Q) is retained, equation (10.18) becomes 

[To + E4(Q) - Ei'J]xi'J(Q) = 0 (10.21) 

where XkJ(Q) an d E ( JJ are the first-order approximations to the nuclear wave 
functions and energy levels. Since the term c KK (Q) is added to £, C (Q) in this 
approximation, equation (10.21) is different from (10.20) and, therefore, 
zLV(Q) an d E^J differ from ^(Q) and E$. In this first-order approximation, 
the molecular wave function flflr, Q) in equation (10.10) also takes the form of 
(10.9). The factor describes the nuclear motion, which takes place in a 

potential field U K ( Q) determined by the electrons moving as though the nuclei 
were fixed in their instantaneous positions. Thus, the electronic state of the 
molecule changes in a continuous manner as the nuclei move slowly in 
comparison with the electronic motion. In this situation, the electrons are said 
to follow the nuclei adiabatically and this first-order approximation is known 
as the adiabatic approximation. This adiabatic feature does not occur in 
higher-order approximations, in which coupling terms appear. 

An analysis using perturbation theory shows that the influence of the 
coupling terms with cyflQ) and A,j_ is small when the electronic energy levels 
£ k (Q) and s;(Q) are not closely spaced. Since the ground-state electronic 
energy of a molecule is usually widely separated from the first-excited 



10.2 Nuclear motion in diatomic molecules 


269 


electronic energy level, the Born-Oppenheimer approximation and especially 
the adiabatic approximation are quite accurate for the electronic ground state. 
The influence of the coupling terms for the first few excited electronic energy 
levels may then be calculated using perturbation theory. 


10.2 Nuclear motion in diatomic molecules 

The application of the Born-Oppenheimer and the adiabatic approximations to 
separate nuclear and electronic motions is best illustrated by treating the 
simplest example, a diatomic molecule in its electronic ground state. The 
diatomic molecule is sufficiently simple that we can also introduce center-of- 
mass coordinates and show explicitly how the translational motion of the 
molecule as a whole is separated from the internal motion of the nuclei and 
electrons. 


Center-of-mass coordinates 

The total number of spatial coordinates for a molecule with £2 nuclei and N 
electrons is 3(£2 + N ), because each particle requires three cartesian coordi¬ 
nates to specify its location. However, if the motion of each particle is referred 
to the center of mass of the molecule rather than to the external spaced-fixed 
coordinate axes, then the three translational coordinates that specify the 
location of the center of mass relative to the external axes may be separated out 
and eliminated from consideration. For a diatomic molecule (£2 = 2) we are 
left with only three relative nuclear coordinates and with 3 N relative electronic 
coordinates. For mathematical convenience, we select the center of mass of the 
nuclei as the reference point rather than the center of mass of the nuclei and 
electrons together. The difference is negligibly small. We designate the two 
nuclei as A and B , and introduce a new set of nuclear coordinates defined by 


Vi M* 

m^ a m 


(10.22a) 


R — Q b — Q a 


(10.22b) 


where X locates the center of mass of the nuclei in the external coordinate 
system, R is the vector distance between the two nuclei, and M is the sum of 
the nuclear masses 


M = M a + M b 

The kinetic energy operator Tq for the two nuclei, as given by equation 
(10.2), is 
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To = 


-h 2 / V 


V 2 

A , 

M a M Bi 


(10.23) 


The laplacian operators in equation (10.23) refer to the spaced-fixed coordi¬ 
nates Q a with components Q xa , Q va , Q :a , so that 

d 2 d 2 d 2 


V 2 = 

a 


OQ: 


d Q 2 ya 


dQ 


,2 1 

za 


a = A, B 


However, these operators change their form when the reference coordinate 
system is transformed from space fixed to center of mass. 

To transform these laplacian operators to the coordinates X and R, with 
components X x , X y , X z and R x , R y , R : , respectively, we note that 
d _ dX x d ()R X d M a d d 

d& A ~ dQ xA dx x + d& A W x - ~Mdx x ~ W x 

d dX x d dR x d M B d d 

ooTb = d q xB dx x + doTs W x = ~m ~dx x + M x 


from which it follows that 


d 2 (M a \ 2 d 2 d 2 2 M a d 2 

d<£ 4 = \ MJ + M dXj)R x 

d 2 _ (M b \ 2 d 2 d 2 2 M b d 2 

d^~ B ~ \MJ 'dX 2 x + W x + M dX x dR x 


Analogous expressions apply for Q yA , Q yB , Q zA , and Q zB . Therefore, in terms of 
the coordinates X and R, the operators V 2 , and V] s are 


v 2 „ = 




. M y 

) vy + v 2 r 

2Ma v V 

(10.24a) 

'M b ' 

) vi- + vi 

2 Mb- _ 

1 V T' - V n 

(10.24b) 

M j 

T- , , Vj-Vj 

M 


where V 2 v = V x -^x and = V K ■ V K are the laplacian operators for the 
vectors X and R and where Xx and V R are the gradient operators. When the 
transformations (10.24) are substituted into (10.23), the operator Tq becomes 



— V 2 +-V 2 
M x n R 


(10.25) 


where u is the reduced mass of the two nuclei 

11 1 
u M a M b 


or 
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MaMb 
Ma + Mg 


(10.26) 


The cross terms in Vx • V r cancel each other. 

For the diatomic molecule, equations (10.1), (10.3), (10.5), and (10.25) 
combine to give 


h 2 

2M 


V 2 x 


h 2 2 
— V 2 „ 
2/u 


ZaZb e 


'2 


R 


IR - E, 


tot 


Tfor = 0 


(10.27) 


where R = r AB is the magnitude of the vector R and where now the laplacian 
operator V 2 in H e of equation (10.4) refers to the position of electron i relative 
to the center of mass. The interparticle distances r A B = R, t'Ai, r m, and r ;/ are 
independent of the choice of reference coordinate system and do not change as 
a result of the transformation from external to internal coordinates. If we write 
'Ptot as the product 

'I'uu = 0(X)TTR, r) 


and E tot as the sum 


E to t 


— E cm + E 


then the differential equation (10.27) separates into two independent differen¬ 
tial equations 

= E cm O(X) (10.28a) 

and 


h 2 2 
■ —Vi 

2/.i 


Z A Z s e 


'2 


R 


H e — E 


*F(R, r ) = 0 


(10.28b) 


Equation (10.28b) describes the internal motions of the two nuclei and the 
electrons relative to the center of mass. Our next goal is to solve this equation 
using the method described in Section 10.1. Equation (10.28a), on the other 
hand, describes the translational motion of the center of mass of the molecule 
and is not considered any further here. 


Electronic motion and the nuclear potential function 

The first step in the solution of equation (10.28b) is to hold the two nuclei fixed 
in space, so that the operator V), drops out. Equation (10.28b) then takes the 
form of (10.6). Since the diatomic molecule has axial symmetry, the eigenfunc¬ 
tions and eigenvalues of H e in equation (10.6) depend only on the fixed value 
R of the internuclear distance, so that we may write them as ip K (r, R) and 
£ k (R). If equation (10.6) is solved repeatedly to obtain the ground-state energy 
£o (R) for many values of the parameter 7?, then a curve of the general form 
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shown in Figure 10.1 is obtained. The value of R for which £o(R) is a minimum 
represents the equilibrium or most stable nuclear configuration for the mole¬ 
cule. As the parameter R increases or decreases, the molecular energy e 0 (R) 
increases. As R becomes small, the nuclear repulsion term Vg becomes very 
large and £o(R) rapidly approaches infinity. As R becomes very large 
(R —> oo), the molecule dissociates into its two constituent atoms. We assume 
that equation (10.6) has been solved for the ground-state wave function 
tpo(r, R) and ground-state energy e 0 (R) for all values of the parameter R from 
zero to infinity. 

The potential energy function Uo(R) for the ground electronic state is given 
by equations (10.19) and (10.16) with fg = (— h 2 /2^t)V 2 R as 


U 0 (R) = £o (R) + c 00 (R) 


e 0 (R) 


fi 2 

y 


Vo(r, R)V 2 R tpo(r, R) dr 


Within the adiabatic approximation, the term cqo(R) evaluates the coupling 
between the ground-state motion of the electrons and the motion of the nuclei. 
The magnitude of this term at distances R near the minimum of £q{R) is not 
negligible 3 for the lightweight hydrogen molecule (all isotopes), the hydrogen- 
molecule ion (all isotopes), and the system He 2 . However, the general shape of 
the function Uo{R) for these systems does not differ appreciably from the 
schematic shape of £o(R) shown in Figure 10.1. For heavier nuclei, the term 
coo(R) is small and may be neglected. For these molecules the Born- 



Figure 10.1 The internuclear potential energy for the ground state of a diatomic 
molecule. 


3 See J. O. Hirschfelder and W. J. Meath (1967) Advances in Chemical Physics, Vol. XII (John Wiley and 
Sons, New York), p. 23 and references cited therein. 
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Oppenheimer and the adiabatic approximations are essentially identical. Since 
we are interested here in only the ground electronic state, we drop the subscript 
on Uq(R) from this point on for the sake of simplicity. 

The functional form of U(R) differs from one diatomic molecule to another. 
Accordingly, we wish to find a general form which can be used for all 
molecules. Under the assumption that the internuclear distance R does not 
fluctuate very much from its equilibrium value R c so that U( R) does not 
deviate greatly from its minimum value, we may expand the potential U ( R) in 
a Taylor’s series about the equilibrium distance R e 

U = U(R C ) + U a \R e )(R - R e ) + ^ U a \R e )(R - R e ) 2 
+ ^ U (3 \R e )(R - R,) 3 + i U (4) (Ro)(R ~ R) 4 + ■ ■ ■ 

where 

■ »= 1 . 2 .- 

QK R=R e 

The first derivative U^fR) vanishes because the potential U(R) is a minimum 
at the distance R e . The second derivative U (2} (R C ) is called the force constant 
for the diatomic molecule (see Section 4.1) and is given the symbol k. We also 
introduce the relative distance variable q, defined as 

q = R-R e (10.29) 

With these substitutions, the potential takes the form 

U(q) = U( 0) + \kq 2 + i[/ (3) (0)? 3 + ±U {4 X0)q 4 + ■ • • (10.30) 


Nuclear motion 

The nuclear equation (10.21) when applied to the ground electronic state of a 
diatomic molecule is 

[T q + U(R)]Xv(R) = E vXv (K) (10.31) 

where the superscript and one subscript on ^.’(R) and on are omitted for 
simplicity. In solving this differential equation, the relative coordinate vector R 
is best expressed in spherical polar coordinates R, 6, <p. The coordinate R is the 
magnitude of the vector R and is the scalar distance between the two nuclei. 
The angles 0 and cp give the orientation of the internuclear axis relative to the 
external coordinate axes. The laplacian operator is then given by (A.61) as 
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i d 
sin 9 89 



1 d 2 
sin 2 9 dtp 2 


= ——(r 2 —') 1 1 2 

R 2 8R\ 8RJ fi 2 R 2 


(10.32) 


where Lr is the square of the orbital angular momentum operator given by 
equation (5.32). With V 2 R expressed in spherical polar coordinates, equation 
(10.31) becomes 


* 2 d ( r2 8 \ 

2 qR 2 8R\ 8RJ 


1 

2/.iR 2 


L 2 + U(R) 


Xv(R, <p) 


E v Xv(R, 0, <P) 


(10.33) 


The operator in square brackets on the left-hand side of equation (10.33) 
commutes with the operator Lr and with the operator L z in (5.3 lc), because Lr 
commutes with itself as well as with L z and neither Lr nor L z contain the 
variable R. Consequently, the three operators have simultaneous eigenfunc¬ 
tions. From the argument presented in Section 6.2, the nuclear wave function 
Xv(R, 9, <p) has the form 

Xv(R, 0, q >) = F(R)Y Jm (9, <p) (10.34) 


where F(R) is a function of only the internuclear distance R, and Y Jm (9, <p ) are 
the spherical harmonics, which satisfy the eigenvalue equation 

L 2 Y Jm {9, <p) = J(J + 1 )h 2 Y Jm (9, tp) 


J = 0, 1,2,...; m = —J, —J + 1, ..., 0, ..., J — 1, J 


It is customary to use the index J for the rotational quantum number. Equation 
(10.33) then becomes 


^ d ( R 2 d A 

2/.iR 2 dR V d RJ 


J(J + 1 )fi 2 
2uR 2 


F U(R) - E v 


F(R) = 0 


(10.35) 


where we have divided through by Y Jm (9, tp). 

We next replace the independent variable R in equation (10.35) by q as 
defined in equation (10.29). Equation (10.35) has a more useful form if we also 
make the substitution S(q) = RF(R). Since dq/dR = 1, we have 


d F(R) 
dR 


1 dS(q) 
R dr/ 


Y- S{q) ’ 


d 

dR 



= R 


d 2 S{q) 
dq 2 


and equation (10.35) becomes 


ft 2 d 2 J(J+ \)fi 2 

2f.i dq 2 2/.i(R e + q) 2 


F U(q) - E 


S(q) = 0 


(10.36) 
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after multiplication by the variable R. 

The potential function U(q) in equation (10.36) may be expanded according 
to (10.30). The factor (R c + q)~ 2 in the second term on the left-hand side may 
also be expanded in terms of the variable q as follows 


1 

(i? e + q) 2 


R: 



-1 (l-* + 3 4- 

Rl \ R? Rl 


(10.37) 


where the expansion (A.3) is used. For small values of the ratio q/R e , equation 
(10.37) gives the approximation R R c . 

If we retain only the first two terms in the expansion (10.30) and let R be 
approximated by R e , equation (10.36) becomes 

~^rw~" r ^ kql ~ W)S(q) = 0 (1038) 

where 

W = E v — 6/(0) - J(J + 1 )B e (10.39) 

B e = fi 2 / 2 j.iR 2 = h 2 /2I (10.40) 

The quantity / (= uR 2 .) is the moment of inertia for the diatomic molecule 
with the internuclear distance fixed at R c and B e is known as the rotational 
constant (see Section 5.4). 

Equation (10.38) is recognized as the Schrodinger equation (4.13) for the 
one-dimensional harmonic oscillator. In order for equation (10.38) to have the 
same eigenfunctions and eigenvalues as equation (4.13), the function S(q) must 
have the same asymptotic behavior as rjj(x) in (4.13). As the internuclear 
distance R approaches infinity, the relative distance variable q also approaches 
infinity and the functions F(R) and S(q) = RF(R) must approach zero in order 
for the nuclear wave functions to be well-behaved. As R —> 0, which is 
equivalent to q —* —R e , the potential U(q) becomes infinitely large, so that 
F(R) and S(q) rapidly approach zero. Thus, the function S(q) approaches zero 
as q —^ —Re and as R —» oo. The harmonic-oscillator eigenfunctions y>(x) 
decrease rapidly in value as |x| increases from x = 0 and approach zero as 
x —► ±oo. They have essentially vanished at the value of x corresponding to 
q = —7?e- Consequently, the functions S(q) in equation (10.38) and y>(x) in 
(4.13) have the same asymptotic behavior and the eigenfunctions and eigenva¬ 
lues of (10.38) are those of the harmonic oscillator. The eigenfunctions S n (q) 
are the harmonic-oscillator eigenfunctions given by equation (4.41) with x 
replaced by q and the mass m replaced by the reduced mass u. The 
eigenvalues, according to equation (4.30), are 
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W n = (n + j)ha), n = 0, 1,2, ... 

where 

a) = \/k//u 

In this approximation, the nuclear energy levels are 

E nJ = U(0) + (n + \)h(o + J(J + 1 )B e (10.41) 

and the nuclear wave functions are 

X*Jm(R, 0, (p) = ^S„(R - Rc)Y Jm (0, <p) (10.42) 

K 


Higher-order approximation for nuclear motion 

The next higher-order approximation to the energy levels of the diatomic 
molecule is obtained by retaining in equation (10.36) terms up to q 4 in the 
expansion (10.30) of U(q) and terms up to q 2 in the expansion (10.37) of 
(R c + q)~ 2 . Equation (10.36) then becomes 

~2p d dq2 ) + [ ^ kq2 + BeAJ + 1} + V ' ]S{q) = [Ev ~ UmS(q) (10 - 43) 
where 


V = - 


2B e J(J + 1) 

R e 


<7 + 


3 B ‘ AJ + 1 f 2 +1 u»\0rf +1 W 


R: 


24 


= b { q + b 2 q 2 + h q 2 + b 4 q 4 


(10.44) 


For simplicity in subsequent evaluations, we have introduced in equation 
(10.44) the following definitions 


, 2B e J(J + 1) 
6l "-~ 

. _ 3«.../(./+ I) 

02 = - 5 - 

b 3 = l - 1/ (3) (0) 

6 

b 4 = ^- u (4 H0) 


(10.45a) 

(10.45b) 

(10.45c) 

(10.45d) 


Since equation (10.43) with V = 0 is already solved, we may treat V as a 
perturbation and solve equation (10.43) using perturbation theory. The unper¬ 
turbed eigenfunctions S^\q) are the eigenkets | n) for the harmonic oscillator. 
The first-order perturbation correction E (4 j to the energy E nJ as given by 
equation (9.24) is 



10.2 Nuclear motion in diatomic molecules 


277 


= {n\ V'\n) = bi(n\q\n) + b 2 (n\q 2 \n ) + 6 3 («|^ 3 |n) + b 4 (n\q 4 \n) 

(10.46) 

The matrix elements (n\q k \n) are evaluated in Section 4.4. According to 
equations (4.45c) and (4.50e), the first and third terms on the right-hand side of 
(10.46) vanish. The matrix elements in the second and fourth terms are given 
by equations (4.48b) and (4.51c), respectively. Thus, the first-order correction 
in equation (10.46) is 

^ =h ]L l '" + ^ +b A{^) (" !+ " + j) 




+ 2 ) 2+ 4 


(10.47) 


where equations (10.40), (10.45b), and (10.45d) have been substituted. 

Since the perturbation corrections due to b\q and b 2 q 2 vanish in first order, 
we must evaluate the second-order corrections E^J in order to find the 
influence of these perturbation terms on the nuclear energy levels. According 
to equation (9.34), this second-order correction is 

f ( 2 ) y- (^19 + b 2 q 3 \n) 2 

nJ 2 -j 77(0) 77(0) 

k&n) h k ~ h n 


2 \ - ( k\q\n) 2 \ - {k\q\n)(k\q 3 \n) 

l ^(k-n)hco 1 3 ^ (k — n)hco 


b l E 


W\n) 2 

(k — n)hco 


k(^n) y ' * k{i^ri) v 

(10.48) 

where the unperturbed energy levels are given by equation (4.30). The matrix 
elements in equation (10.48) are given by (4.45) and (4.50), so that E (2 j 
becomes 


7(2) 
J nJ 


E 

~{n 

+ 1 )h 

nh 

hco 

- 

2/uco 

2 /.ico 

2b\b 2 

((n + 1 )h\ 1/2 


hco 


(n + 1 )h\ l/2 ^ /(n + l)fr\ 3 ^ 2 / nh \ 1//2 ^ / nh \ 3 ^ 2 

2 fio) ) \ 2 q.(o J \2jUO) J \2/.ico J 


h 


hco \2uo) 


( n +lX' l +2)C + 3) + 9(B+ i)3 


-9n b - 


3 n(n — 1 )(n — 2) 


This equation simplifies to 
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p(2) _ _ 


b\ 


, .. b\h 2 9 

(« + i) - o ^ 4 (30»~ + 30« + 11) 


2/uu 2 pRco 3 v 27 8^ 3 o» 4 

Substitution of equations (10.40), (10.45a), and (10.45c) leads to 


E d) = _ 

nJ h 2 (0 2 


J 2 {J+ 1) 2 4 


2B 2 e R e U {3) (0) 
ptha) 3 


(n J(J + 1 ) 


h 2 [U (3 \ 0)] 2 
288/r 3 &> 4 


30 (w +| 


(10.49) 


The nuclear energy levels in this higher-order approximation are given to 
second order in the perturbation by combining equations (10.41), (10.47), and 
(10.49) to give 


E n J 


E ( „j + E 


(i) 

nJ 


+ E 


( 2 ) 

nJ 


— (0) -(- ho) (/? T 1) — fllQX e (m T ~\~B e J (,/ T 1) 

- DJ 2 (J + l) 2 - a e (« + i) J(J + 1) (10.50) 

where we have defined 




D = 


a e = 


4S 2 


_ e 


h 2 ct) 2 

-6Bl 

hat 




3 /.to) 2 J 


(10.51a) 

(10.51b) 

(10.51c) 

(10.51d) 


The approximate expression (10.50) for the nuclear energy levels E nJ is 
observed to contain the initial terms of a power series expansion in (n +1) and 
J(J + 1). Only terms up to (n +1) 2 and [./(./ + l)] 2 and the cross term in 
(n + \)J(J + 1) are included. Higher-order terms in the expansion may be 
found from higher-order perturbation corrections. 

The second term on the right-hand side of equation (10.50) is the energy of a 
harmonic oscillator. Since the factor x e in equation (10.51a) depends on the 
third and fourth derivatives of the intemuclear potential at R c , the third term in 
equation (10.50) gives the change in energy due to the anharmonicity of that 
potential. The fourth term is the energy of a rigid rotor with moment of inertia 
/. The fifth term is the correction to the energy due to centrifugal distortion in 



Problems 


279 


this non-rigid rotor. As the rotational energy increases, the internuclear 
distance increases, resulting in an increased moment of inertia and conse¬ 
quently a lower energy. Thus, this term is negative and increases as / increases. 
The magnitude of the centrifugal distortion is influenced by the value of the 
force constant k as reflected by the factor co 2 in D. The last term contains both 
quantum numbers n and / and represents a direct coupling between the 
vibrational and rotational motions. This term contains two contributions: a 
change in vibrational energy due to the centrifugal stretching of the molecule 
and a change in rotational energy due to changes in the internuclear distance 
from anharmonic vibrations. The constant term ?/(()) merely shifts the zero- 
point energy of the nuclear energy levels and is usually omitted completely. 

The molecular constants co, B e , x e , D , and a e for any diatomic molecule may 
be determined with great accuracy from an analysis of the molecule’s vibra¬ 
tional and rotational spectra. 4 Thus, it is not necessary in practice to solve the 
electronic Schrodinger equation (10.28b) to obtain the ground-state energy 
£o (R). 


Problems 

10.1 Derive equation (10.47) as outlined in the text. 

10.2 Derive equation (10.49) as outlined in the text. 

10.3 Derive equation (10.50) as outlined in the text. 

10.4 An approximation to the potential U(R) for a diatomic molecule is the Morse 
potential 

U(R) = —De(2e~ fl( *~' R ' ) - e la(R Rc> ) = -D e {2sT aq - e~ 2aq ) 
where a is a parameter characteristic of the molecule. The Morse potential has 
the general form of Figure 10.2. 

(a) Show that U(R e ) — —D e , that (/(oo) — 0, and that U (0) is very large. 

(b) If the Morse potential is expanded according to equation (10.30), relate the 
parameter a to u, oo, and D e 

(c) Relate the quantities x e , a e , and 77 (0) in equation (10.50) to u, co, and D e 
for the Morse potential. 

10.5 Another approximate potential U(R) for a diatomic molecule is the Rydberg 
potential 

U(R) = -D e [l + b(R - TAfle- 6 ^^ = —Z> e (l + bq)e hq 
where b is a parameter characteristic of the molecule. 

(a) Show that U(R C ) — —D e , that U( oo) = 0, and that (7(0) is veiy large. 

4 Comprehensive tables of molecular constants for diatomic molecules may be found in K. P. Huber and G. 
Herzberg (1979) Molecular Spectra and Molecular Structure: IV. Constants of Diatomic Molecules (Van 
Nostrand Reinhold, New York). 
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Figure 10.2 The Morse potential for the ground state of a diatomic molecule. 


(b) If the Rydberg potential is expanded according to equation (10.30), relate the 
parameter b to u, (», and D e . 

(c) Relate the quantities x e , a e , and '?/ (0) in equation (10.50) to u, a>, and D c 
for the Rydberg potential. 

10.6 Consider a diatomic molecule in its ground electronic and rotational states. Its 
energy levels are given by equation (10.50) with J — 0. The value of U(R ) at 
R — Re is — D e . 

(a) If the anharmonic factor x e is positive, show that the spacing of the energy 
levels decreases as the vibrational quantum number n increases. 

(b) When the vibrational quantum number n becomes sufficiently large that the 
difference in energies between adjacent levels becomes zero, the molecule 
dissociates into its constituent atoms. By setting equal to zero the derivative 
of E „o with respect to n, find the value of n in terms of x e at which 
dissociation takes place. 

(c) Relate the well depth D e to the anharmonic factor x e and compare with the 
corresponding expressions in problems 10.4 and 10.5. 
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Useful power series expansions 


Binomial expansions 


( u + v Y— 


H! 


( U +vr n = Yl 


i^a\{n-a)\ 

_ ^ ( — !)”(« + a - jj! („+g) 


a=0 


a!(n — 1)! 

d n uv _ ^ n\ d a ud n - a v 

a!(« _ a j\ d z a d z H-a 


dr” 


(A.l) 


(A.2) 
(A-3) 
(A.4) 


Useful integrals 


dz == 2 


~ dz = 7T 1 / 2 


ze ” dz — 4 


z 2 e z " dz = 2 


o 


z 2 e ■’ dz =? 2jr 1,/2 


1/2 

: dz =-e“ 

a 


-a 2 z 2 +bz a _' a b 2 /4a 


-az 2 Q c(b-zf ^ Q acb 2 /(a-c) 

V a — C 


ro ° sin 2 z 


dz = 2 T 


(A.5) 
(A-6) 
(A.7) 
(A-8) 
(A-9) 
(A. 10) 
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e ips 2ji k 

, (I+u)^' dS= t! pe 


COS H0d0 == 2 jt, ?! = 0 

T 

— 0, ?! = 1, 2, 3, 


sin ?20 d0 = 0, 

T 

cos ???0 cos nd Ad — 2 

r 

sin m6 sin nddd = 2 


n = 0, 1, 2, ... 


cos mO cos ?!0d0 = Jid n 


sin m6 sin nOdd = Ji6 ri 


cos ???0sin nOdO — 0 


sin 2 0 d6 — id — 4sin 26 — Ud — sin 6 cos 6) 


6 sir? Odd = \d 2 — \6sm26 — g cos 26 


6~ sin" # d6 — Id 3 — i(0" — 4)sin 26 — id cos 26 


sin 3 6 dd — j cos 3 6 — cos 6 — — | cos 6 + A cos 3 6 


sin 5 0d0 = — | cos 6 + Acos 30 — A C os 56 


sin 7 0d0 = — g|cos0+ §5cos30 - 350cos50 + 455cos 16 


sin kd sin nd dd — 


sin(A: — n)6 sin(A: + n)Q 
2 (k — n) 2 (k + n) 


(k 2 ^ n 2 ) 


Integration by parts 


u dv — uv — 


vdu 


dv(x) 

u(x) —-—- dx — u(x)v(x) — 
dx 


du(x) 

f(x) —-— dx 
dx 


(A. 11) 

(A. 12) 
(A. 13) 

(A. 14) 

(A. 15) 

(A. 16) 

(A. 17) 

(A. 18) 

(A. 19) 

(A.20) 

(A.21) 

(A.22) 

(A.23) 


(A.24) 
(A.25) 


Gamma function 


n?!) 


/*oo 


r(«) = 

Jo 

r(?i + 1 ) 
= (?! - 1)!, 


z"-‘e- z dz 

*= ?ir(?7) 

n = integer 


(A. 26) 

(A.27) 
(A.28) 
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— OO 


r(i) = jt 1/2 

_(2„-l)/2 e -, d . = r 

0 


2n + 1 


283 
(A.29) 
(A.30) 


Trigonometric functions 


e‘ e = cos 0 + i sin 0 

(A.31) 

cos 0 = i(e i<? + e~ w ) 

(A.32) 

sin 0 — jj(e ie — e~ ie ) 

(A.33) 

sin 2 0 + cos 2 6 s= 1 

(A.34) 

cos(0 + tp) — cos 0 cos (p — sin 0 sin cp 

(A.35) 

sin(0 + (p) — sin 0 cos tp + cos 0 sin cp 

(A.36) 

cos 2 0 = cos 2 0 — sin 2 0 

(A.37) 

sin 26 — 2sin0cos 0 

(A.38) 

sin 30 = 3sin 6 — 4 sin 3 0 

(A.39) 

sin 50 = 5 sin 0 — 20 sin 3 0 + 16sin 5 0 

(A.40) 

70 = 7sin 0 — 56sin 3 0 + 112sin 5 0 — 64sin 7 0 

(A.41) 

— cos 0 = —sin 0 
d0 

(A.42) 

d 

— sin 0 = cos 0 
d0 

(A.43) 

-^-sin _1 z = arcsinz = (1 — z 2 ) -1 / 2 

dz dz 

(A.44) 


Hyperbolic functions 

cosh 0 = i(e e + e~ e ) 

(A.45) 

sinh 0 = i(e e — e~ e ) 

(A.46) 

coshi0 == cos 0 

(A.47) 

sinh i0 = i sin 0 

(A.48) 

cosh 2 0 — sinh 2 0 = 1 

(A.49) 

. „ sinh 0 
tanh 0 = —— 
cosh 0 

(A.50) 

sinh 29 — 2sinh 0 cosh 0 

(A.51) 

dcosh0 . 

M = 511,1,0 

(A.52) 

d sinh 0 „ 

——— = cosh 0 
dc/ 

(A.53) 

d tanh 0 1 

d0 cosh 2 0 

(A.54) 
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Schwarz’s inequality 


For z — x + iy, |z| 2 ; 

I \ a{x)\ 2 Ax 


\a(x) 2 dx 

\b(x) 2 dx 3= 

a*(x)b(x) dx 

Imz 2 ; since Imz — z — z 

’*/2i, |z| 2 - 


„*|2 


|b(x)| 2 Ax 


\a*' (x)b(x) — a(x)b* (x)] Ax 


Vector relations 

A B = AB cos 9 
| A X B| = AB sin 0 
9 is the angle between A and B 


Spherical coordinates ( r , 6, cp) 

x — r sin 9 cos <p, y — r sin 0 sin cp, 
At — r 2 sin 9 dr A9 Acp 


V 2 ip 


1 d ( 2 dx). A 1 d ( . dy.’\ 
r 2 dr \ dr) ^ r 2 sin 9d9 \ d9) 


z—r cos 9 
l d 2 y.> 

r 2 sin 2 0 dcp 2 


Plane polar coordinates (r, cp ) 


x = r cos cp, 
_ 2 1 d 

^' P = l-Wr 


y — r sin cp 
dip\ 1 d 2 xl> 
dr) ^ r 2 dcp 2 


(A.55) 

(A. 5 6) 


(A.57) 
(A. 5 8) 


(A.59) 
(A.60) 

(A.61) 

(A. 62) 
(A.63) 
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Fourier series 

An arbitraiy function / (6) which satisfies the Dirichlet conditions can be expanded as 

OO 

f(9) = ^- + cos n9 + b n sin nO) (B.l) 

where 9 is a real variable, n is a positive integer, and the coefficients a„ and b„ are 
constants. The Dirichlet conditions specify that f(9) is single-valued, is continuous 
except for a finite number of finite discontinuities, and has a finite number of maxima 
and minima. The series expansion (B.l) of the function f(0) is known as a Fourier 
series. 

We note that 

cos n(9 + 2ji) — cos nO 
sin n(9 + 2n) = sin nO 

so that each term in equation (B.l) repeats itself in intervals of 2k. Thus, the function 
f(0) on the left-hand side of equation (B.l) has the property 

f{e+2jt)=m 

which is to say, f(0) is periodic with period 2 ji. For convenience, we select the range 
— ji 0 =£ jt. for the period, although any other range of width 2 ji is acceptable. If a 
function F(<p) has period p, then it may be converted into a function f(0) with period 
2 ji by introducing the new variable 0 defined by 0 = 2Jtcp / p, so that 
f(9) = F(2jzcp/p). If a non-periodic function F(0) is expanded in a Fourier series, the 
function f(0) obtained from equation (B.l) is identical with F(0) over the range 
—ji 9 =£ ji, but outside that range the two functions do not agree. 

To find the coefficients ci n and b„ in the Fourier series, we first multiply both sides 
of equation (B.l) by cos m.9 and integrate from —jt to n. The resulting integrals are 
evaluated in equations (A. 12), (A. 14), and (A. 16). For n = 0, all the integrals on the 
right-hand side vanish except the first, so that 

•it 

f (9) d9 — — X 2jt — jia 0 

J — JT 2 

For m > 0, all the integrals on the right-hand side vanish except for the one in which 
n — m, giving 
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f(d )cos md d$ = na m 

Tt 

If we multiply both sides of equation (B.l) by sin mO, integrate from —it to it, and 
apply equations (A. 13), (A.15), and (A.16), we find 

/(0)sin mddd — itb m 

Tt 

Thus, the coefficients in the Fourier series are given by 


1 

ci n — 

it 


f(8 )cos nOdO, 


n = 0, 1, 2, ... 


(B.2a) 


b n — 

it 


/(@)sin nddd, 


n~ 1,2, 


(B.2b) 


In deriving these expressions for a„ and b„, we assumed that f(0) is continuous. If 
f(9) has a finite discontinuity at some angle ()<> where —it < do < it, then the expres¬ 
sion for a„ in equation (B.2a) becomes 

1 f e ° 1 

f(9 )cos nddd + — 


(In 

it 


it 


f(9 )cos n9d8 


A similar expression applies for b„. The generalization for a function f(9) with a finite 
number of finite discontinuities is straightforward. At an angle 9q of discontinuity, the 
Fourier series converges to a value of / (8) mid-way between the left and right values 
of f(9) at 6 0 ; i.e., it converges to 

lim l[f(8 0 -e) + f(9 0 + e)\ 

The Fourier expansion (B.l) may also be expressed as a cosine series or as a sine 
series by the introduction of phase angles a„ 


f{9) = y + ^2 c n cos (nd + a„) 


= ^ c' n sin(n0 + a' n ) 


(B.3a) 

(B.3b) 


n =0 


where c„, c'„, a n , a'„ are constants. Using equation (A.35), we may write 
c n cos (n9 + a„) — c n cos nd cos a n — c„ sin n9 sin a n 

If we let 


Cl n — C n COS CL n 

b n — —c n since,, 

then equations (B.l) and (B.3a) are seen to be equivalent. Using equation (A.36), we 
have 

c'„ sin (nd + a'„) — c'„ sin nd cos a'„ + c'„ cos nd sin a' n 

Letting 

a 0 — 2c'o sinao 

a„ — c'„ sin a'„, n > 0 

b„ — c' n cos a' n , «>0 
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we see that equations (B.l) and (B.3b) are identical. 


Other variables 

The Fourier series (B.l) and (B.3) are expressed in terms of an angle 6. Flowever, in 
many applications the variable may be a distance x or the time t. If the Fourier series is 
to represent a function /(x) of the distance x in a range — / =£ x l, we make the 
substitution 


e = ^ 


in equation (B.l) to give 


x a o ( tmx , . njtx\ m , s 

fix) = — + 2^ ( a„ cos — + b n sin — j (B .4) 

n= 1 ^ ' 


with a„ and b n given by 


/ 


mix 


-t 


f(x) cos —— dx, n — 0, 1,2,.. 


A -I'' 
- / 


njtx 

/(x)sin —— dx, 1, 2, 

-/ l 


(B.5a) 

(B.5b) 


If time is the variable, then we may make either of the substitutions 

„ 2jit 

6 — -= cot 

P 

where p is the period of the function f(t) and co is the angular frequency, so that 
equation (B.l) becomes 


/(0 = y + ( Qn 


n =1 


2jint . 2jint 
, cos-1- b n sm — 


a o \—' , 

= — + } \a„ cos ncot + b n sm ncot ) 

(B.6) 


n= 1 


The constants a„ and b n in equations (B.2) for the variable t are 


2 

•p! 2 

2nnt 


O) 

a n 

/(Ocos 

At - 


P 

-p! 2 

P 


jt 

, 2 

'p/ 2 

2 jint 


0) 

bn — 

/(Osin 


At - 


P. 

-p/2 

P 


jt 


^njoj 

—71 j (J) 

•jt/co 

—71 j (1) 


f(t) cos mot At 


/(r)sin ncot At 


(B.7a) 

(B.7b) 


Complex form 

The Fourier series (B.l) can also be written in complex form by the substitution of 
equations (A.32) and (A.33) for cos n() and sin nO, respectively, to yield 

OO 

m = E c « e ‘" e ^- 8 ) 

n =—oo 


where 
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a n - i b„ 


2 

a n + i b„ 
2 


n> 0 

n> 0 


(B.9) 


a 0 


The coefficients c„ in equation (B.8) may be obtained from (B.9) with a„ and b„ given 
by (B.2). The result is 

c„=^-f f(9)c- ind dG (B. 10) 

2^ J-Jt 

which applies to all values of n, positive and negative, including n = 0. We note in 
passing that c „ is the complex conjugate c* of c n . 

In terms of the distance variable x, equations (B.8) and (B.IO) become 


fix) = c » e ‘ 


,i njzx/1 


° n 21 


f(x)e- inxx !' dx 


while in terms of the time t, they take the form 

OO 

mm Y, c " elnajt 


co 
2 71 


n =—oo 
•jt/oj 


—7l/(D 


f(t)e- lnw, dt 


(B .11) 
(B.12) 

(B.13) 

(B.14) 


Parseval’s theorem 

We now investigate the relation between the average of the square of / (0) and the 
coefficients in the Fourier series for f(9). For this purpose we select the Fourier series 
(B.8), although any of the other expansions would serve as well. In this case the 
average of \f{0)\ 2 over the interval — ji ^ 6 =£ n is 


1 

2 jt 


T \m\ 2 de 

—Jt 


The square of the absolute value of f(6) in equation (B.8) is 

OO 12 OO OO 


\m \ 2 = 


E 


C n Q 


i nO 


E E 

m =—oo n =—oo 


r * r J(n-m)6 


(B. 15) 


where the two independent summations have been assigned different dummy indices. 
Integration of both sides of equation (B.15) over 6 from —jt to n gives 

'JT OO OO eJt 

\m\ 2 de= y Y c >- e ' ( ” m)ddd 

—Jt m =—oo n =—oo 

Since m and n are integers, the integral on the right-hand side vanishes except when 
m—n, so that we have 



289 


Fourier series and Fourier integral 
•n 

e' ( n- m )° d 0 = 2jt 6 mn 

—71 

The final result is 

1 PJt OO 

— |/(0)| 2 &0 — Y, k«| 2 (B. 16) 

—71 n =—oo 

which is one form of Parseval’s theorem. Other forms of Parseval’s theorem are 
obtained using the various alternative Fourier expansions. 

Parseval’s theorem is also known as the completeness relation and may be used to 
verify that the set of functions c'"° for —oo oo are complete, as discussed in 

Section 3.4. If some of the terms in the Fourier series are missing, so that the set of 
basis functions in the expansion is incomplete, then the corresponding coefficients on 
the right-hand side of equation (B.16) will also be missing and the equality will not 
hold. 


Fourier integral 

The Fourier series expansions of a function /(x) of the variable x over the range 
— / x l may be generalized to the case where the range is infinite, i.e., where 
—oo *£ x oo. By a suitable limiting process in which l —r oo, equations (B.ll) and 
(B.12) may be extended to the form 

1 f°° 

/(/) = -= g(k)e ikx dk (B.17) 

V 2.JZ —oo 

g(k) = ~^= P /(x)e- ifa dx (B. 18) 

Equation (B.17) is the Fourier integral representation of fix). The function g(k ) is the 
Fourier transform of f(x), which in turn is the inverse Fourier transform of g(k). 

For any function / (x) which satisfies the Dirichlet conditions over the range 
—oo s; v ^ oc and for which the integral 

•OO 

|/(x)| 2 dx 

— OO 

converges, the Fourier integral in equation (B.17) converges to /(x) wherever /(x) is 
continuous and to the mean value of /(x) at any point of discontinuity. 

In some applications a function /(x, t), where x is a distance variable and t is the 
time, is represented as a Fourier integral of the form 

i r°° 

fix, t) = -= 6(A-)e i|tv '• vU) ' 1 dk (B.19) 

v t — oo 

where the frequency <o(k) depends on the variable k. In this case the Fourier transform 
g(k) takes the form 

g(k) = G{k)e- 10j(k)t 
and equation (B. 18) may be written as 

i r°° 

G(k) = —= fix, t)e dx (B.20) 

v t —oo 

The functions fix, t) and G(k) are, then, a generalized form of Fourier transforms. 
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Another generalized form may be obtained by exchanging the roles of x and t in 
equations (B. 19) and (B.20), so that 

1 f°° 

~ ~‘' 1 ' (B.21) 


fix, t) = 


G(c w) 


y/2n 


1 


\72tt 


-OO 

/(x, t)e- W(0)x - mt] dt 


(B.22) 


Fourier integral in three dimensions 

The Fourier integral may be readily extended to functions of more than one variable. 
We now derive the result for a function / (x, y, z) of the three spatial variables x, y, z. 
If we consider fix, y, z) as a function only of x, with y and r as parameters, then we 
have 

1 fOO 


f(x, y, z) = 
giikx, y, z) = 


\flit 

1 

\fl7t 


gi(k x , y, z)e lkxX dk : 


— OO 
POO 


fix, y, -)e lkxX dx 


(B.23a) 


(B.23b) 


We next regard gi{k x , y, z ) as a function only of y with k x and z as parameters and 
express gi(k x , y, z) as a Fourier integral 

1 fOO 


giik x , y, z) = 


gi(k x , ky, z ) = 


\/2jt 


\Jln 


giikx, ky, z)e' kry d k y 

giikx, y, z)e~' kyy dy 


Considering giik x , k v , z) as a function only of z, we have 


gi(k x , k y , z) = 


1 


g(k x , ky, k z )e' k - z dk : 


gik x , k v , k z ) 


\j2n 

] fOO 


giikx, ky, z)e ' kzZ dz 


,v >; ? z ' ^— ' SZ\ ,V X9 ,v y 

Combining equations (B.23a), (B.24a), and (B.25a), we obtain 

OO 

1 


fix, y, z) 


g{k x , k y , k k )e' ikxX+kyy+kzZ) dk x dk y dk z 


(2tt) 3 / 2 , 

— OO 

Combining equations (B.23b), (B.24b), and (B.25b), we have 

OO 

1 


gik x , ky, kf) 


(2jt) 3 / 2 


fix, y, z)e- i(kxX+kyy+kzZ) dxdvdz 


(B.24a) 

(B.24b) 

(B.25a) 

(B.25b) 

(B.26a) 

(B.26b) 


If we define the vector r with components x, y, z and the vector k with components 
k x , ky, k z and write the volume elements as 

dr = dx d y dz 


dk Sj dk x dky dk z 


then equations (B.26) become 
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1 


m = 


g( k) 


(2 jr ) 3 / 2 

1 

(2jt) 3 / 2 


g(k)e ik ' r dk 
/(r)c lk ' r dr 
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(B.27a) 

(B.27b) 


ParsevaVs theorem 

To obtain Parseval’s theorem for the function/(x) in equation (B.17), we first take the 
complex conjugate of f(x) 


fix) 


1 

\fldX 


g*(k')e~ ik x dk' 


where we have used a different dummy variable of integration. The integral of the 
square of the absolute value of / (x) is then given by 


|/(x)| 2 dx 

— OO 


fix)fix) dx 

— OO 


OO 

l r 

2 71 , , 

— OO 


g*(k')g(k)e (k - k ' )x dkdk'dx 


The order of integration on the right-hand side may be interchanged. If we integrate 
over x while noting that according to equation (C.6) 

•OO 

e i(k~ k ') x d T — 2nd(k — k') 

— OO 

we obtain 


|/(x)| 2 dx 

— OO 


OO 

g*(k')g{k)d(k- k')dkdk' 

—OO 


Finally, integration over the variable k' yields Parseval’s theorem for the Fourier 
integral, 

° |/(x)| 2 dx = P \g(k)\ 2 dk (B.28) 

—oo —OO 

Parseval’s theorem for the functions /(r) and g(k) in equations (B.27) is 

|/(r)| 2 dr= [|g(k)| 2 dk (B.29) 


This relation may be obtained by the same derivation as that leading to equation 
(B.28), using the integral representation (C.7) for the three-dimensional Dirac delta 
function. 
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Dirac delta function 


The Dirac delta function d(x) is defined by the conditions 

d(x) = 0, for x f 0 
= oo, for x — 0 


such that 


(C.l) 


d(x) dx = 1 (C.2) 

J —OO 

As a consequence of this definition, if / (x) is an arbitrary function which is well- 
defined at x — 0 , then integration of /(x) with the delta function selects out the value 
of /(x) at the origin 

/ (x)<5(x) dx — f (0) (C.3) 

The integration is taken over the range of x for which /(x) is defined, provided that the 
range includes the origin. It also follows that 

f(x)d(x - x 0 ) dx = f{x o) (C.4) 

since d(x — x 0 ) = 0 except when x = x 0 . The range of integration in equation (C.4) 
must include the point x — x 0 . 

The following properties of the Dirac delta function can be demonstrated by 
multiplying both sides of each expression by /(x) and observing that, on integration, 
each side gives the same result 


d(— x) = d(x) 

dicx) — — d(x), c real 

c 

(C.5a) 

(C.5b) 

xd{x — xo) = xqc>(x — xo) 

(C.5c) 

o 

II 

3 

X 

(C.5d) 

f(x)d(x - x 0 ) = /(xo)d(x - x 0 ) 

(C.5e) 

As defined above, the delta function by itself lacks mathematical rigor and has no 
meaning. Only when it appears in an integral does it have an operational meaning. 

That two integrals are equal does not imply that the integrands 

are equal. However, for 

the sake of convenience we often write mathematical expressions involving <5(x) such 
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as those in equations (C.5a-e). Thus, the expressions (C.5a-e) and similar ones 
involving d(x) are not to be taken as mathematical identities, but rather as operational 
identities. One side can replace the other within an integral that includes the origin, for 
(5(0), or the point x 0 for d(x — x 0 ). 

The concept of the Dirac delta function can be made more mathematically rigorous 
by regarding ()(x) as the limit of a function which becomes successively more peaked 
at the origin when a parameter approaches zero. One such function is 

(5(x) = lim ' c A ’ 2 / f;2 
£—>0 JT l l 2 £ 

since 


and 


7r*/ 2 £ 


- x2 ! £l dx = 1 


1 _ r 2 / 2 

-e ' ' —>oo as x —► 0, e0 
£ 


0 as x —> ±oo 


Equation (C.3) then becomes 
lim 


1 


/(x)e x2/f 2 dx = /(O) 


o jr!/2 e 

Other expressions which can be used to define (5(x) include 

1 £ 


and 


lim . 

£—>0 JIX Z + £ Z 


lim—e“ w/£ 
£—>0 2 £ 


The delta function is the derivative of the Heaviside unit step function H{x), defined 
as the limit as e —> 0 of H(x, £) (see Figure C.l) 


H{x, £) — 0 


for x < — 
2 


x 1 

: e + 2 


for — 


—£ 
~2 


£ 

X - 
2 


Thus, in the limit we have 


= 1 


for x > - 
2 


H(x) = 0 for x < 0 
= l for x = 0 


= 1 for x > 0 


and AH/ dx, which equals <5(x), satisfies equation (C.l). The differential d H (x, t) 
equals dx/e for x between —e/2 and e/2 and is zero otherwise. If we take the integral 
of (3(x) from — oo to oo, we have 


<5(x) dx 


d H — lim 

£—>0 


AH{x, e) = 


r*/ 2 


1 


1 /e 


-dx = - ;±;)=1 

-c/2 e £ \2 2, 
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Figure C.l The Heaviside unit step function H(x ), defined as the limit as e —* 0 of 
H(x, e). 


and condition (C.2) is satisfied. 

We next assume that the derivative d'(x) of d(x) with respect to x exists. If we 
integrate the product f(x)d’(x) by parts and note that the integrated part vanishes, we 
obtain 


•OO 

f(x)d'(x) dx 

— OO 


*oo 

/'(x)d(x)dx = -/'(0) 

—OO 


where f'(x) is the derivative of fix). From equations (C.5a) and (C.5d), it follows that 


d'(—x) = <5'(x) 


x<5'(x) — —d(x) 

We may also evaluate the Fourier transform d(k) of the Dirac delta function 
d(x - x 0 ) 


<Xk) ■= 


1 


\/2n 


d(x — x 0 )e lkx dx 


1 


y/2n 




The inverse Fourier transform then gives an integral representation of the delta 
function 


d(x - x 0 ) = 


1 


x/2tt 


d(k)e ikx dk = ^~ 
2jz 


JHx-xo) dk 


(C.6) 


The Dirac delta function may be readily generalized to three-dimensional space. If r 
represents the position vector with components x, y, and z, then the three-dimensional 
delta function is 


<5(r - r 0 ) = d(x - x 0 )d(y - y 0 )d(z - z 0 ) 
and possesses the property that 
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where the range of integration includes the points x 0 , y (h and z 0 . The integral 
representation is 

d(r-ro) = -M°° e' k(r r <»dk 
(2 jt) j J_oo 

where k is a vector with components k x , k v , and k z and where 

dk = d k x d k y d k z 


(C.7) 
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Hermite polynomials 


The Hermite polynomials //„(£) are defined by means of an infinite series 
expansion of the generating function g(g, s), 


g(i, s) = e 2 ^ 2 = e^ ( ^ 2 = Y, H,m- 

n =0 


(D.l) 


where — oo ^ ^ oo and where .v < 1 in order for the Taylor series expansion to 
converge. The coefficients H n (g) of the Taylor expansion are given by 


= 


d”g(M, s) 


ds n 


5=0 


=ee % ,e ^ efi 


(D.2) 


5=0 


For a function f(x + y) of the sum of two variables x and y, we note that 

'df\ (dp 


dx 


dy 


Applying this property with x — s and y — —£ to the «th-order partial derivative in 
equation (D.2), we obtain 

£)n i n 

= (-ir-L(c- f - ef ) 

\s=0 

and equation (D.2) becomes 


— (e- (i_l)2 ) 

ds » 1 ’ 




- ( - 1)n dF e 


-? 2 




(D.3) 


Another expression for the Hermite polynomials may be obtained by expanding 
g(£, s) using equation (A.l) 

_ „*(2£-i) _ ^(2g ~ S) k 

k\ 


g(L s) = - Y'- 


k =0 


Applying the binomial expansion (A.2) to the factor (2£ - s) k , we obtain 




and gQ, s) takes the form 
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g(£, *) = YZ 


(-l) a 2 k - a £ k - a s 


k+a 


k= 0 a=0 a '^ _ a ^' 

We next collect all the coefficients of s n for an arbitrary n, so that k + a — n, and 
replace the summation over k by a summation over n. When k — n, the index a equals 
zero; when k — n — 1, the index a equals one; when k — n — 2, the index a equals 
two; and so on until we have k — n — M and a — M. Since the index a runs from 0 to 
k so that a =£ k, this final term gives M ss n — M or M n/2. Thus, for k + a — n, 
the summation over a terminates at a — M with M — nj 2 for n even and M — 

(n — l)/2 for n odd. The result of this resummation is 


M 


g&> = I] Yj 


(— 1)“2 


i—2atn—2a 


.. =0 a=0 a! ( n — 2a )! 

Since the Hermite polynomial H„® divided by n\ is the coefficient of s n in the 
expansion (D.l) of g(£, s), we have 

M t i \a 

J v L ' tn-2a 


H„® = 2” 


2 2a a!(n — 2a)! 


(D.4) 


We note that H„('£,) is an odd or even polynomial in £ according to whether n is odd or 
even and that the coefficient of the highest power of £ in H„( '£,) is 2". 

Expression (D.4) is useful for obtaining the series of Hermite polynomials, the first 
few of which are 


H 0 ® = 1 

Hi® - 8£ 3 - 12£ 

Hi® = 2^ 

H 4 ® = 16£ 4 - 48£ 2 + 12 

H 2 ® = 4£ 2 - 2 

H s ® = 32£ 5 - 160£ 3 + 120£ 


Recurrence relations 

We next derive some recurrence relations for the Hermite polynomials. If we 
differentiate equation (D.l) with respect to s, we obtain 


2(£ - s)e 2Ss - s2 


E H «® 

n=\ 


(n - 1)! 


The first term (n — 0) in the summation on the right-hand side vanishes because it is 
the derivative of a constant. The exponential on the left-hand side is the generating 
function g(£, s ), for which equation (D. 1) may be used to give 


2« 

n=0 n= 1 ^ 


~,n— 1 


(n - 1)! 


Since this equation is valid for all values of s with ,v| < 1, we may collect terms 
corresponding to the same power of 5, for example s n , and obtain 

2 £H„® 2Jk^® = kh+m 

n\ (n — 1)! n\ 


or 

H n+ 1 (|) - 2 £H n ® + 2nH„_ l ® = 0 (D.5) 

This recurrence relation may be used to obtain a Hermite polynomial when the two 
preceding polynomials are known. 
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Another recurrence relation may be obtained by differentiating equation (D.l) with 
respect to £ to obtain 

2 se 2 ^ 2 = f^ dHnS " 


n =0 


d£ n\ 


Replacing the exponential on the left-hand side using equation (D.l) gives 


2s E*-®i=E 


> d H„ s" 


... d£ n\ 

n =0 n =0 ^ 

If we then equate the coefficients of s", we obtain the desired result 

—nr — 2nH„-\(§) 

The relations (D.5) and (D.6) may be combined to give a third recurrence relation. 
Addition of the two equations gives 


(D.6) 


H n+ 1©- - - j HJf) 

With this recurrence relation, a Hermite polynomial may be obtained from the 
preceding polynomial. By applying the relation (D.7) to H n (£,) k times, we have 

k 


Hn +k g) = ( 2£- H„(g) 


(D.7) 


(D-8) 


Differential equation 

To find the differential equation that is satisfied by the Hermite polynomials, we first 
differentiate the second recurrence relation (D.6) and then substitute (D.6) with n 
replaced by n - 1 to eliminate the first derivative of H n \(f) 


d 2 H n d H n 
= 2 n- 


1 = 4 n(n - 


d £ 2 d£ 

Replacing n by n — 1 in the first recurrence relation (D.5), we have 
H n (£) - 2 ^ fl _,(|) + 2(« - l)tf B _ 2 (£) - 0 
which may be used to eliminate H n 2 (f ) in equation (D.9), giving 

a2 lj 

+ 2 nH„(g) - „_!(£) = 0 

Application of equation (D.6) again to eliminate H n i(£) yields 

d 2 H n f A 

~ + 2nH „(^) ■= 0 

which is the Hermite differential equation. 


(D.9) 


(D.10) 


Integral relations 

To obtain the orthogonality and normalization relations for the Hermite polynomials, 
we multiply together the generating functions gif, s) and g(f, t), both obtained from 
equation (D.l), and the factor e ^ and then integrate over £ 


/ = 


e -~g(£, .'')£(£, /) d^ — EE 


=0 m =0 


n!m! 




H„(g)H m (g)d£ (D.ll) 
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For convenience, we have abbreviated the integral with the symbol I. To evaluate the 
left integral, we substitute the analytical forms for the generating functions from 
equation (D. 1) to give 

I = e^V^e 2 ^ 2 d£ = Q lst °° e- (| -*~° 2 d(£ — s — t) = jt l/2 e 2st 

J—oo J— oo 

where equation (A. 5) has been used. We next expand e 2st in the power series (A.l) to 
obtain 


/ = 


JT 1 ' 2 


OO 


E 

n =0 


2 ”s n t n 
n\ 


Substitution of this expression for I into equation (D.l 1) gives 


ji 


1,2 E 

n =0 


2 n (st) n 


OO OO 


EE 

n =0 m =0 


s n t m 

n\m\ 


•OO 

e-* 2 H n (g)HJg)d£ 

—OO 


(D.l 2) 


On the left-hand side, we see that there are no terms for which the power of s is not 
equal to the power of t. Therefore, terms on the right-hand side with n / m must 
vanish, giving 

•OO 

e-* 2 #„(£)#„(§) d£ - 0, n + m (D.13) 

— OO 

The Hermite polynomials H„('£,) form an orthogonal set over the range —oo =5 £ =£ oo 
with a weighting factor c ' . If we equate coefficients of (si)" on each side of equation 
(D.l2), we obtain 

•OO 

c~' e [H n (Z)\ 2 d£ = 2"n!jr 1 ( 2 

—OO 

which may be combined with equation (D.13) to give 

•OO 

fT* 1 H n {g)H m (g) d£ - 2"n\jt 1 / 2 d nm (D.14) 

— OO 


Completeness 

If we define the set of functions <p„(^) as 

0»(D = (2"n!r 1 / 2 ^- 1 / 4 e-f 2 / 2 fl B (D (D.15) 

then equation (D.14) shows that the members of this set are orthonormal with 
weighting factor unity. We can also demonstrate 1 that this set is complete. 

We begin with the integral formula (A.8) which, with suitable definitions for the 
parameters, may be written as 


-(* 2 /4)+^ d5 _ 27r l/2 e ^ 


(D.l 6) 


If we replace e in equation (D.3) by the integral in (D. 16), we obtain for H „('£,) 


27T 1 / 2 9 ^ 


poo / 1 \ n 

e HV4)+iSs ds = e l 


2 JT 1 / 2 


poo f)n 

e - 2 / 4 ^L 




(-*)" ^ 
2jt 1 1 2 


*oo 

e -(, 2 /4)+i^ 5 « d5 

—oo 


1 See D. Park (1992) Introduction to the Quantum Theory , 3rd edition (McGraw-Hill, New York), p. 565. 
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The function < p „(£) as defined by equation (D.15) then becomes 


<Pn(£) = ■ 


(—0" 


2(2"jr«!) 1 / 2 jr 1 / 4 

We now evaluate the summation 


,S 2 /2 


-(. 2 /4)+i|, 5 „ ^ 


(D.17) 


£0,(l)0n(r) 


H=0 


by substituting equation (D.17) twice, once with the dummy variable of integration 5 
and once with s replaced by t. Since the functions <j> „('£,) are real, they equal their 
complex conjugates. These substitutions give 




n=0 


1 C ^W°° 

4jt 3 / 2 


00 ( _ 1 \n 

,-[(i 2 +d)/4]+i(^+ro d , 




dsdt 


since (—i) 2 " — (— I)". The summation on the right-hand side is easily evaluated using 
equation (A. 1) 

,(-!)" (si\ 


E 

n =0 


= Q - St ! 2 


Noting that 

we have 

OO 

n=0 


s 2 + t 2 i st _(s+ ty 

4 h 4 


J_ e (^r 2 )/2 r 


4jt 3 / 2 


-[0+0 2 /4]+i(fs+l'0 


(D. 18) 


The double integral may be evaluated by introducing the new variables u and v 


s + t 


s — t 

v = —-— or s — u + v, t — u — v 


dsdt — 2 dudv 

The double integral is thereby factored into 


-« 2 +i(|+|> du 


e i(S t> dv — 2 x ?r 1/,2 e )2/<4 X 2jtd(t; — £') 


where the first integral is evaluated by equation (A.8) and the second by (C.6). 
Equation (D. 18) now becomes 

OO 

5>»(£¥»(£') = e [( « 2+r2)/2] - [(l+ «' )2/4] d(^ - £') - e^ 2 / 4 <5(£ - £') 

«=0 

Applying equation (C.5e), we obtain the completeness relation for the functions ( p„Q ) 

OO 

£0 B (S)0«.(t') «*(£-£') (D.19) 

n=0 

demonstrating, according to equation (3.31), that the set 0„(£) is a complete set. 
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Legendre polynomials 

The Legendre polynomials Pi(/u) may be defined as the coefficients of s 1 in an 
infinite series expansion of a generating function g(ju, s ) 


g(M, s) = (1 - 2 gs + s 2 ) 1/2 = p iiP) s ' 


(E.l) 


1=0 


where —1 =£ u =£ 1 and .v < 1 in order for the infinite series to converge. 
We may also expand g(pi, s ) by applying the standard formula 


m = (i - ^r 1/2 = £ 


n =0 


d”/ 

dz 1 


z=0 


=E 


r" 1 ■ 3 • 5 ■ 


■(2«-l) 


n=0 


i ! 


oo 


E 

n =0 


(2/?)! 

2 2 "(«!) 2 


If we set z = s(2u — s), then g(u, s) becomes 


g(V, s) = 


OO 


E 

n=0 


(2/7)! 

2 2 "(«!) 2 


s n (2fi - s) n 


With the use of the binomial expansion (A.2), the factor (2u — s) n can be further 
expanded as 


(2 [x - s) 


- = V 


(-!)“«! 


a[(n ~ «) ! 


(2 It)" 


so that 


go*> = 

n =0 a=0 


(-1)“(2 n)\fi n ~ a 
2 n+a n\a\(n — a)! 


^»+a 


We next collect all the coefficients of s 1 for some arbitrary l and replace the summation 
over n with a summation over /. Since n + a = l, when n = l, we have a = 0; when 
n — l — 1, we have a — 1; and so on until n — l — M,a — M, where M si l — M or 
M 1/2. The summation over a terminates at a—M, with M — 1/2 for / even and 
M — (l — l)/2 for / odd, because a cannot be greater than n. The result is 
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OO M 




(-\) a (2l-2d)\p 


/-2a 


/=0 a=0 2 a!(/ “ a ) ! ( Z “ 2a ^ ! 

Since the Legendre polynomials are the coefficients of s l in the expansion (E.l) of 
g(p, s ), we have 


M 


P, ( u) = T (-m21-2a)\ , 2a 


(E.2) 


We see from equation (E.2) that P/(u) for even l is a polynomial with only even powers 
of p, while for odd / only odd powers of u are present. 

The first few Legendre polynomials may be readily obtained from equation (E.2) 
and are 


Po(p) - 1 

Flip) = \i5p 3 ~ 3 p) 

p { (p) = p 

P A ip) = p5p 4 - 30 p 2 + 3 ) 

Flip) ¥, - 2 iV ~ 1 ) 

Psip) — \i62p 5 — 70/< 3 + 15 p) 


We observe that P/(I) = 1, which can be shown rigorously by setting p — 1 in 
equation (E.l) and noting that 

OO OO 

g(l, s) = (1 - sy l =J2 S ' = H W)s ' 

1=0 1=0 

Since P/(p) is either even or odd in p, it follows that P/(—1) =P (—1) ? and that 
P/(0) w 0 for / odd. 


Recurrence relations 

We next derive some recurrence relations for the Legendre polynomials. Differentia¬ 
tion of the generating function g(p, s) with respect to 5 gives 

dg p-s ip - s)g 

ds 


(1 — 2 p + s' 2 ) 3 / 2 1—2 p + s 2 


i= l 


lP/(p)s 


i -1 


(E.3) 


The term with / = 0 in the summation vanishes, so that the summation now begins 
with the / = 1 term. We may write equation (E.3) as 


(p - s ) ^ Pi(p)s‘ — (1 — 2ps + 5 2 ) ^ lPi(p)s l 1 


1=0 


i=i 


If we equate coefficients of s 1 1 on each side of the equation, we obtain 

pPi-i(p) - Pi-iip) — IP tip) ~ 2(1 - 1 )pPi-i(ju) + (/ - 2 )P,- 2 (P) 


or 

IPi(p) - (21 - 1 )pP,-i(p) + (7 - 1 )P,- 2 (P) - 0 (E.4) 

The recurrence relation (E.4) is useful for evaluating P/(p) when the two preceding 
polynomials are known. 

Differentiation of the generating function g(p, s) in equation (E.l) with respect to p 
yields 

dg = sg 
dp 1 — 2 ps + s 2 

which may be combined with equation (E.3) to give 
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dg , dg 

s al = (l, ~ s ^ 


so that 


OO OO t P 

Y ipms 1 = («- 5) ^ —-V 

/=i /=o ‘ 

Equating coefficients of s' on each side of this equation yields a second recurrence 
relation 

d Pi dP/_, 

fl ~dp~ ~dp~ ~ P '^ = ° ^ E ' 5 

A third recurrence relation may be obtained by differentiating equation (E.4) to give 

l -j— — (21 — l)g —r—- — (21 — 1 )P,_ l ( M ) + (l- 1)^ = 0 
dp d p dp 

and then eliminating dPi_ 2 /dg by the substitution of equation (E.5) with 1 replaced by 
1 — 1. The result is 


d Pi dP / , 

--—- IPi-i(m) = 0 

dw d« 


(E.6) 


Differential equation 

To find the differential equation satisfied by the polynomials l J i(u), we first multiply 
equation (E.5) by —p and add the result to equation (E.6) to give 

(1 - p 2 )^p- + 1/uPiifi) - lPi-\(ju ) = 0 

d g 

We then differentiate to obtain 


d2 

(1 - g 2 ) ——y — 2g + lfi : f 1 + IPl(tl) - 1 


dP, 

dp 


dP, 

dp 


d P,- 


= 0 


dp 2 ~ r dp ' r dp ' "‘ d/< 

The third and last terms on the left-hand side may be eliminated by means of equation 
(E.5) to give Legendre’s differential equation 


, d 2 Pi d Pi 

(1 — P ) ~ 2 p -j-f 1(1 + 1 )Pi(p) — 0 


(E.7) 


Rodrigues ’formula 

Rodrigues’ formula for the Legendre polynomials may be derived as follows. Consider 
the expression 

v—(p 2 - iy 

The derivative of v is 

^ = 2 lp(p 2 — 1) ,_1 = 2lpv(p 2 — 1) _1 
dg 

which is just the differential equation 

, dt; 

(1 — g 2 )-—|- 21pv — 0 

dg 

If we differentiate this equation, we obtain 
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-k d 2 v ... . dv 

' ] dii 

We now differentiate r times more and obtain 


(1 — [A ) ——j H- 2(7 — 1)/^ ——(-2 Iv — 0 
z d/u 


7 d r+2 v d r+x v d r v 

" ->‘' ) A^- +2[1 - r -' ) ' , 4^ + {r+ r) d^ = ° 


(E.8) 


If we let r = / and define w as 


d l v d 1 2 n i 
w = — = —(p - 1) 


d/< 7 d/i 7 


then equation (E.8) reduces to 


d 2 w 


dw 


(l-p z )—~2p — +l(l+l)w=0 
dp- dp 

which is just Legendre’s differential equation (E.7). Since the polynomials Pi(p) 
represent all of the solutions of equation (E.7), these polynomials must be multiples of 
w, so that 

d*_ 

dp 1 

The proportionality constants c/ may be evaluated by setting the term in p 1 , namely 

d 7 

Cl 


Pi(u) = Cl— j(p 2 - l) 1 


21 ( 2 /)! / 
v* = c/ ^ 


equal to the term in p 1 in equation (E.2), i.e., 

( 2 /)! 


2'(/!) : 


u 


1 


Thus, we have 

C/ = 2 >n 

J_d ? 

Vd.dp 

This expression (equation (E.9)) is Rodrigues’ formula. 


and 




(E.9) 


Associated Legendre polynomials 

The associated Legendre polynomials P’”(p) are defined in terms of the Legendre 
polynomials Pi(p) by 

p?(p) = (i - p 2 y n/2 d ^rr ) (E-10) 

where m is a positive integer, m — 0, 1, 2,..., /. If m = 0, then the corresponding 
associated Legendre polynomial is just the Legendre polynomial of degree I. If m > l, 
then the corresponding associated Legendre polynomial vanishes. 

The generating functions g i m) (p, s) for the associated Legendre polynomials may 
be found from equation (E.l) by letting 
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Legendre and associated Legendre polynomials 

g (n %t, ,^(.-, 2 r /2d ^ 

d 8 ^ S) = 3-5. (2m - l> m (l - lus + 5 2 )- (m+ 5 ) 


(2m)\ 
2 '"/«! 


5 m (l - 2fis + 5 2 r (m +j) 


we have 


g im) (lL s) = Y j P?( f i)s , = 


(2m)\(l - p 2 ) m / 2 s‘ 


(E. 11) 


i=m 2 m m\(l-2/US +s 2 ) m+ i 

We can also write an explicit series for Pf(fi) by differentiating equation (E.2) m 
times 

pm(l , n ,2ynl2^ (-!)“(2/-2«)! /< / -'"- 2 “ 

p,oo- (i -,o }_ Jo2lal(l _ aW _ m _ 2a)l (E.12) 

where M' — (l — m)/2 or (/ — m — l)/2, whichever is an integer. Furthermore, 
combining equation (E.10) with Rodrigues’ formula (E.9), we see that 

1 d l+m 

W = jTfld - - 0' (E.13) 

The first few associated Legendre polynomials are 

Pl(p) = P 0 (m) = 1 

^?Cm) = p 

p\(p) — (i — / <2 ) 1/2 

^ 2 (/<) - Piip) = |(3/r 2 - 1) 

.PjOm) = 3/fiT - /< 2 ) 1/2 
P 2 2 (p) = 3(1-fi 2 ) 


Differential equation 

The differential equation satisfied by the polynomials P'"(u) may be obtained as 
follows. Let r — l + m in equation (E.8) and define w m as 


w,„ = 


Al+m d m P, 

a , i t / „ a n 


d/r 


l-\-m 


(ft 2 - 1)' = 2 l l\ 


dp" 


so that 


w m = 2 / /!(l - ft 2 ) ml2 P]'(p) 


(E-14) 
(E. 15) 


Equation (E.8) then becomes 

d 2 w, 


dw, 


(1 - n 2 )—-f - 2 (m + 1 )p~r 1 + W + 1) - m(m + l)]w m - 0 
dfi- dp 


(E.16) 
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We then substitute equation (E. 15) for w m and take the first and second derivatives as 
indicated to obtain 


( 1-/0 


2 ,d 2 P”i 0 d Pf 


dp 2 


— 2p 


dp 


+ 


/(/ + 1) - 


1 — p 2 


P?(P) = o 


(E.17) 


Equation (E.17) is the associated Legendre differential equation. 

Orthogonality 

Equation (E.17) as satisfied by P'"(p) and by P™(p) may be written as 


and 


_d_ 

dp 

_d_ 

dp 


(1 - p 2 ) 


d PI 


( 1 - 0 ) 


dp 

d Pf 


dp 


+ 


+ 


1(1 + 1 ) - 


1 — p 2 


/'(/' + !)- 


1 — p 2 


P?(P) = o 


p?0*) = 0 


If we multiply the first by P"!(ju) and the second by Pf(p) and then subtract, we have 


pm _ 

v dp 


A pm 


_ pm _ 

1 dp 


( 1 - 0 ) 


d P m 
2 ar r 


dp 


= [/'(/' + l)-/(/+l)]P^r 


We then add to and subtract from the left-hand side the term 


d P m d P m 

(1 -p 2 ) 1 l ' 


so as to obtain 
d 

dp 


(1-0) [P'r 


, dp 'i 
dp 


- P", 


, d p';; 

dp 


dp dp 


[/'(/' + l)-l(l+l)]P^PJ! 


We next integrate with respect to u from — 1 to +1 and note that 


d P m 

2x/ D 


(i-O) [P" 


dp 


-PI 


, dp r 

dp 


- 0 


giving 

[/'(/' + 1)- /(/+ 1)] 
If l' / /, then the integral must vanish 


P n ;P'\) dp = 0 




(E. 18) 


so that the associated Legendre polynomials P"'(p) with fixed in form an orthogonal 
set of functions. Since equation (E.18) is valid for m — 0, the Legendre polynomials 
P/(p) are also an orthogonal set. 


Normalization 

We next wish to evaluate the integral //„ 

Ilm = 


[P"i(p)fdp 


As a first step, we evaluate 7/o 
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d 


I/O — 


[Pi(,U)Y d [i 


-i 


We solve the recurrence relation (E.4) for P/(p), multiply both sides by P/(p), integrate 
with respect to fi from —1 to +1, and note that one of the integrals vanishes according 
to the orthogonality relation (E. 18), so that 


pP t (p)Pi-x(p)dp 


[Pi(p)] 1 dp = 2I —- 

-1 L 

Replacing / by / + 1 in equation (E.4), we can substitute for /) on the right-hand 
side. Again applying equation (E.18), we find that 


-l 


[Pi(p)f dp 


21 - 1 
21 + 1 


[Pi-i(p)f dp 


-1 


This relationship can then be applied successively to obtain 


[P,(p)f dp 


(2/ — 1)(2/ — 3) 
( 2 /+ 1 )( 2 /- 1 ) 


[Pi-iip)] 2 dp 


-l 


(21- 1 )(2/ — 3) • • • 1 
(21 + 1)(2/ — 1 )(2/ — 3) • • • 3 


[ P ()(/0l 2 dp 


1 


21 + 1 , 

Since Pq( 1) = 1, the desired result is 
d 


[P 0 (p)f dp 


1 


,[/>,»] 2i+1 


d 9 

dp — 

l 


21 + 1 

We are now ready to evaluate From equation (E.10) we have 


1 tm — 


( 1-/0 


2, m ld m Pi' 2 fl 


-1 \d P 

Integration by parts gives 

,d m P, d n, - l P,' 1 


dp 


(l 

i dp m dp \ dp” 


, m— 1 


2 \m ’ 


f/w — (1 /^ ) 


dp m dp" 


d m ~ l Pi d 

! dp m ~ l dp 


(1-0) 


2\m d m Pj 


dp" 


(E.19) 


dp 


dp (E.20) 


The integrated part vanishes because (1 — p 2 ) — 0 at p — ±1. 

To evaluate the integral on the right-hand side of equation (E.20), we replace m by 
m — 1 in (E.16) and multiply by (1 — p 2 )"‘ 1 to obtain 

,,2\m d 2 W m -l ,,2\m— 1 dw w —1 

(1 p ) 2 2mp(\ p ) 


d,w 2 


dp 


+ [/(/ + 1 ) - m(m - 1)](1 - p 2 ) m ~ l w m -1 = 0 


which can be rewritten as 
_d_ 
dp 


( 1_ 2 )m dw™-i 
dp 


— —(I + m)(l — m + 1)(1 — p ) m Wji-i = 0 


From equation (E.14) we see that 
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Wm ~ l ~ 2 ' d/h"- 1 


dw m _i / d m P/ 

= = 2 /! - 


d/i 


dp" 


_d_ 

d/i 


(l-/i 2 )' 


, d m P, 


d/i" 


= —(/ + ni){l — m + 1)(1 — /r) 


2\m— 


Thus, equation (E.20) takes the form 

hm — (/ + w)(/ - m + 1) 


( 1-/0 


2 \m— 1 


Using equation (E.10) to introduce P" 1 1 (/i), we have 


w—1 


77 


d/i 


m— 1 


, d™ 1 P/ 
d/i”' -1 


d/i 


Ilm — V + «*)(/ - m + 1 ) 


[P?~ l (M)Y d/i 


= (/ + m)(l - m + l)//, m -i 

which relates //,„ to //_,,, ]. This process can be repeated until 7/ 0 is obtained 
hm — [(/ + »?)(/ + m ~ 1)][(7 “ m + 1)(( - m + 2)]7/, m _2 


so that 


- [(/ + w)(/ + in - 1) • • • (/ + 1)][(7 - m + 1)(/ 
_ (/ + m)\ /! 

“ 71 (/- my. 


m + 2) • • • /]/, 0 


[ J pr(/i)] 2 d/i 

-i 


2(/ + wz)! 

(2/ + !)(/ — mj! 


Completeness 

The set of associated Legendre polynomials PJ'(u) with m fixed and / = m, 
m + 1, ..., form a complete orthogonal set 1 in the range — 1 =£ /i «£ 1. Thus, an 
arbitrary function /(/i) can be expanded in the series 

OO 

/(/i)-^a /mJ Pf(/i) 

/= m 

with the expansion coefficients given by 


1 The proof of completeness may be found in W. Kaplan (1991) Advanced Calculus, 4th edition (Addison- 
Wesley, Reading, MA) p. 537 and in G. Birkhoff and G.-C. Rota (1989) Ordinary Differential Equations, 
4th edition (John Wiley & Sons, New York), pp. 350-4. 
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&lm 


2 (l + 77?)! 

The completeness relation for the polynomials P"'(fi) is 
* 21 + 1(1 — /??)! 


P'i(u)f(lt)d,u 


E : 


(/ + m)\ 


P7( M )p?(p') = d( M - f i') 
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Laguerre and associated Laguerre polynomials 


Laguerre polynomials 

The Laguerre polynomials L^(p) are defined by means of the generating 
function g(p, s) 


e -ps/p-s) X s k 

g(p, S)= =^L k{p) 

k=0 


(F-1) 


where 0 =£ p *£ oo and where .v < 1 in order to ensure convergence of the infinite 
series. Since the right-hand term is a Taylor series expansion of g(p, s), the Laguerre 
polynomials are given by 

d k g(p, s) __ 

ds k 


Lk(p) 


ds k 


d k 


1 -5 


(F.2) 


To evaluate Lk(p) from equation (F.2), we first factor out d' in the generating 
function and expand the remaining exponential function in a Taylor series 


gild s) = 


1 - 5 


-p/O-i) 




We then take k successive derivatives of g(p , s ) with respect to s 

n \ 


< 2=0 


^p> = „f : tlld ia+1)ia+m - sr ^ 


a =0 


d k g(p, s) _ „^ (-i y P a (a + k)\ e , „\-(a+i+U 

ds k “ At a! a! ( 


a =0 


When the Mi derivative is evaluated at ,v = 0, we have 

hip) ~- 


^d{-X) a (a + k)\ 

/ y P 

< 2=0 


( a !) 2 


(F.3) 
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Using equation (A.l) we note that 


— (p k Q - p ) = — V ( ~ 1)a p a+k 


^(-\) a (a+k)\ 

/ / /~l\? ^ 

a=0 


- ' d a\ " ^ (a!) 2 

Combining equations (F.3) and (F.4), we obtain the formula for the Laguerre 
polynomials 

L k {p) = ^ >k (p k e p ) 

Another relationship for the polynomials L k (p) can be obtained by expanding the 
generating function g(p, s) in equation (F.l) using (A.l) 


(F.4) 


(F.5) 


g(P, s) = ■ 


-ps/{\ — s) OO 


1 -5 


E 

a=0 


(-i) a p° 


a! (1 - s) a+1 


The factor (1 - s) ,a M) may be expanded in an infinite series using equation (A.3) to 
obtain 


(1 - s) 


-(a+l) = \^ ( a + P)' „0 


E' 

J3=0 


a\ft\ 


so that g(p, s) becomes 


' (-1 na+p)\p a a+fS 

( a\) 2 /3\ S 


g( P , u-zi:- 

a =0 0=0 

We next collect all the coefficients of s k for an arbitrary k, so that a + ft k, and 
replace the summation over a by a summation over k. When a—k, the index ft equals 
zero; when a = k — 1, the index ft equals one; and so on until we have a = 0 and 
ft — k. Thus, the result of the summation is 

(_i y-Ppk-P k 


g(p, *)=*!££—— 


W/M> M-P)W 

Since the Laguerre polynomial L k (p) divided by k\ is the coefficient of s k in the 
expansion (F.l) of the generating function, we have 

_ k /-_i \k —0 

L k (p) = (k\f'£ u [J D ^ m P k - p 


0=0 


[(k-ftWftr 


If we let k — ft — y and replace the summation over ft by a summation over y, we 
obtain the desired result 

(-IF _ v 




(F.6) 


A third relationship for the polynomials L k (p) can be obtained by expanding the 
derivative in equation (F.5), using (A.4), to give 


hip) 


'E; 


k\ 


a Ak—a c 


d a p K d 


E 


(-l) k ~ a k\d a p 


a ~k 


a=( j a\(k - a)! dp a &p k ~ a ^ al(k - a)! dp“ 

We now observe that the operator [(d/dp) — \] k may be expanded according to the 
binomial theorem (A.2) as 
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[(d/d p) - 1]* 


so that 


(-1) A '[1 - (d/dp)]^ 


k 


<->>*£ 

a=0 


(-If 7! d“ 
a\(k — a)! d p a 


L k {p ) - [(d/d p) - 1] V (F.7) 

where we have noted that (-1)“ == (- I) a . 

From equation (F.2), (F.5), (F.6), or (F.7), we observe that the polynomial L k (p) is of 
degree k and we may readily obtain the first few polynomials of the set 

U)(p) = 1 

L\ (p) - 1 - p 

L 2 (p) = 2-4 p + p 2 

7-3 (p) = 6 - 18p + 9p 2 - p 3 


We also note that Z-a-(O) = k\. 


Differential equation 

Equation (F.5) can be used to find the differential equation satisfied by the polynomials 
L k (p). We note that the function f(p) defined as 

fip) = P k e- p 

satisfies the relation 

/>w+(P - k)f = 0 
dp 

If we differentiate this expression k + 1 times, we obtain 

where f [k) is the 7th derivative of f(p). Since from equation (F.5) we have 

■= e~ p L k (p) 

the Laguerre polynomials L k (p) satisfy the differential equation 

P , 2 + (1 ~ P) ~r~ + kLk(p) = 0 (F.8) 

dp z dp 


Associated Laguerre polynomials 

The associated Laguerre polynomials L J k (p) are defined in terms of the Laguerre 
polynomials by 

L J k (p) = ^L k (p) (F.9) 

Since L k (p) is a polynomial of degree k, L k k (p ) is a constant and L'fp) — 0 for j > k. 
The generating function g(p, s; j ) for the associated Laguerre polynomials with fixed 
j is readily obtained by differentiation of equation (F. 1) 
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d 7 (-l)V 


dp' V 1—5 / (1 — ,s)- /+l 


g(p ’ s; j) = dpi {-T^r) = (T^r e ' ps/(1 “ 4 = g ^ *i (R10) 

The summation in the right-hand term begins with k — j, since j cannot exceed k. 

We can write an explicit series for L J k (p) by substituting equation (F.6) into (F.9) 

jj, (~l) y d 7 y,_ (~1F _ y —j 

k l ) ( ’ g (y!) 2 (* - /)! dpj P ( 0 g y!(* - y)!(y - /)! P 

The summation over y now begins with the term y — j because the earlier terms 
vanish in the differentiation. If we let y — j — a and replace the summation over y by 
a summation over a, we have 


dw=(*yi: <-ry. + v 

^ a! (^ - J - a)\(j + a)! 


(F-11) 


For the purpose of deriving some useful relationships involving the polynomials 
L J Ap), we define the polynomial A \(p) as 


A ^ P) ~ a +/)! L ’ +j(p) 

If we replace the dummy index of summation k in equation (F.10) by i, where 
i — k — j, then (F.10) takes the form 


(F-12) 




Ll j+j(Pi t+j = j y- a Kp) i 

(i + jy. ' U d 


i =o v 1=0 

Thus, A j(p) are just the coefficients in a Taylor series expansion of the function 
s~ j g(p, 5 ; j) and are, therefore, given by 


A /(p) = tt-5 j g(p, s; j) 


Substituting for g(p, 5 ; j) using equation (F.10), we obtain 

£)' e -p«/(i-0 

A 7 (p) = (-1V Try 71 I\7TT 


I'E 

e -ps/(!-i) 

cA' 

O 

II 

>1 

1 

+ 

1 

»'E 

CD 

CD 

1 

T 

ds‘ 

0 

jl 

1 

+ 

■’-s 

1 


L»v ( ~ p) “ 

^ a!(l - s) a +J+ l 

cA ! 


= (-!> 




i=o a! (a + ^- 


(F.13) 


We next note that 
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d ! 

dp' 


(P'^e-O 


d y^ ( l) »a+ j+i _ y^ (~1)° («+7 + 0! 

p a +j 

a! a! {a + iV 


• ^ (a+ _/'+/)! 


(F ■ 14) 


Comparison of equations (F. 13) and (F.14) yields the result that 

A i(p) - (- l) y P y e p (p I+J eT p ) 

From equation (F.12) we obtain 

. (i /V d z 

zy/p) = (-ly^y^p v>—(p'^e o 

Finally, replacing z by the original index k(— i + j ), we have 

W = < F - 15 ) 

Equation (F.15) for the associated Laguerre polynomials is the analog of (F.5) for the 
Laguerre polynomials and, in fact, when j = 0, equation (F.15) reduces to (F.5). 


Differential equation 

The differential equation satisfied by the associated Laguerre polynomials may be 
obtained by repeatedly differentiating equations (F.8) j times 

d 3 L k d 2 L k d Li 

^ + < 2 -V + d-i)y=° 


dp 4 


dp 3 


dp 2 


d y+2 4,,. 11 ,d j+l L k , „ .,d^,_ A 

P j : ,-+9 + 0 + 1 - P) ,, +{k- j) —— - 0 


dp /+2 


dp ,+1 


dp 7 


When the polynomials Lj(p) are introduced with equation (F.9), the differential 
equation is 


1 —-Jr + 0 + 1 - P) + (k - j)L' k (p) — 0 


dp 2 


(F-16) 


Integral relations 

In order to obtain the orthogonality and normalization relations of the associate 
Laguerre polynomials, we make use of the generating function (F. 10). We multiply 
together g(p, 5 ; j), g(p, t; j ), and the factor p J+v eT p and then integrate over p to give 
an integral that we abbreviate with the symbol I 


7 = 


00 00 f/i 


pJ+ v e P g(p, s ; j)g(p, t; j) dp « EEs 


a=j (5=j 


1 a\f\ 


J+ v p 


5 7»7 2 (p)dp 


(P-17) 
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To evaluate the left-hand integral, we substitute the analytical forms of the generating 
functions from equation (F.10) to give 

(sty 


i = 


(i - sy +1 (i - ty +1 


where 


a = 1 -F ■ 


- + - 


p j+ V Q-°P ^ 


1 —st 


i - 5 ' i - 1 (i - y>(i - 1 ) 

The integral in equation (F. 18) is just the gamma function (A.26), so that 


p j+ V Q- a P ^ = 


ro' + v+i) (i + v)\ 


j + v > 0 


1 = 


(F -18) 


j 0 aJ +v+1 aJ+ v + 1 

where we have restricted v to integer values. Thus, I in equation (F. 18) is 

(/• + v)W(i -yrq - ty 

(1 — st)i +v+x 
Applying the expansion formula (A.3), we have 

( 1 _ styU^+t) = y^ (/+v+Q! • 

( ° h 0-+v)!,1 ™ 

If we replace the dummy index i by a, where a — i + j , then this expression becomes 


(1 - st) - (j+v+]) = 


(a + v)\ 


a=J 


■ O' + v)\(a - j)\ 


(st) 


p-J 


and / takes the form 


Combining this result with equation (F. 17), we have 

OO oo ^ s a fP 


a=j p=j 


(a + v)\ 


EE — \ p j+v c PL^L^dp = (1 - s) v (l - p'Yttctv^" < F19 ) 


a=J 


(a-/)! 


We now equate coefficients of like powers of s and t on each side of this equation. 
Since the integer v appears as an exponent of both 5 and t on the right-hand side, the 
effect of equating coefficients depends on the value of v. Accordingly, we shall first 
have to select a value for v. 

For v — 0, equation (F.19) becomes 


EE 

a=j p=j 


S a t fS 

a\pi 


p J e p L J a (p)L J p (p)dp 


^ a \ 

, (« -./)! 


(stf 


a=j 


Since the exponent of s on the right-hand side is always the same as the exponent of t, 
the coefficients of s a t/ 1 for a / (> on the left-hand side must vanish, i.e. 

^ p J c~ p L J Jp) Li(p) dp = 0; a±p (F.20) 

Jo 

Thus, the associated Laguerre polynomials form an orthogonal set over the range 
0 =£ p -x, with a weighting factor p'c p . For the case where s and t on the left-hand 
side have the same exponent, we pick out the term p — am the summation over (>, 
giving 
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E 

a=j 


(St) a 

( a !) 2 




a=y 


Equating coefficients of (■??)“ on each side yields 

f 00 „ ta'l 3 

Equations (F.20) and (F.21) may be combined into a single expression 

(a!) 3 


p J s p LJ a (p)L J p (p)dp = ^—^-6 


a/3 


(F.21) 


(F.22) 


For v = 1, equation (F.19) becomes 


00 00 s a t@ f0 ° 


„ P ' +le PL/ a(P) L fl(P)*P 


a=j 0=j 


J2 (a + l) l ! [( 5 0 a + (^) a+1 - * a+1 ^ i a+1 ] 


«=/ 


• (a -./)! 


Equating coefficients of like powers of 5 and t on both sides of this equation, we see 
that 


p J+l e p L J a (p)LUp) dp — 0; ft ^ a, a ± 1 


and that 


a![(a+ l )!] 2 


p i+l e p [L J a (p)] 2 dp — (a!) : 


P J 'e p L J (p)L J ftp) dp = - 
o (a-;)! 

a! 


(a + 1)! 


+ 


L(a-y)! (a-l-y)U 


(2 a-y+l)(a!) 3 
(a - /)! 


(F.23) 


(F.24) 


(F.25) 


The term in which ft — a — 1 is equivalent to the term in which ft — a + 1 after the 
dummy indices a and ft are interchanged. Equations (F.23), (F.24), and (F.25) are 
pertinent to the wave functions for the hydrogen atom. 


Completeness 

We define the set of functions Xkjip) by the relation 

row)! i 1 / 2 


Xkjip) 


(*!> 


3 P 2 e p 


4(p) 


(F-26) 


According to equation (F.22), the functions Xkjip) constitute an orthonormal set. We 
now show 1 that this set is complete. 

Substitution of equation (F.15) into (F.26) gives 

1 1//2 d k ~ J 

[k\(k — j)\ 


Xkj(p) = (-iy 


d p k ~J 


■(P k e~ p ) 


(F.27) 


If we apply equation (A. 11), we may express the derivative in (F.27) as 


1 D. Park, personal communication. This method parallels the procedure used to demonstrate the complete¬ 
ness of the set of functions in equation (D. 15). 
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- « ill 




dp k -J 2jrdp k -J 

so that Xkj(p) in integral form is 
(—iy'i k ~j 


Xkjip ) 


2jz 


(l+is) k + 1 
k\p J & 


ds 


i‘ 1 

2 71 


^1 ps 


-iaP~\ */ 2 


Vik-m 


dps 


(1 + is) 


k +1 


(1 + is) i+1 

s k ~ J ds 


i k - j ds 


(F.28) 


To demonstrate that the set Xkj(p) is complete, we need to evaluate the sum 


^2xkj(p)Xkj(p') 

k=j 

Expressing (E28) in terms of the dummy variable of integration ,s' for p and in terms of 
t for p', we obtain for the summation 

OO 

'}^Xkj(P)Xkj(p') 

k=j 


= 4^ {PPT 


j/ 2 e (p+p')/2 

•OO 

*00 

e i (ps+p't) 

"y'(-l) 7 J kl ( st) k -J 

- 

—OO- 

—OO 

fcj (k ~ ./)! [l+i(5+/)-st]^ 


dsdt 


By letting a — k — j, we may express the sum on the right-hand side as 

V ( ~ 1)tt( “ +7)! [1 + i(s + t) - s/ ]- ( «+/+l ) ( rf )« = y![l + i(5 + 0] _O+1) 

z— j a i 


a=0 


where we have applied equation (A.3) to evaluate the sum over a. We now have 


°° /1 

^Xkj(p)x,(p') = (ppT j/ 2 e (p+p ' )/2 


k=j 


J(ps+p't) 


[1 + i(s + t)]j+ 


J dsdt (F.29) 


To evaluate the double integral, we introduce the variables u and v 
s + t 


u = 


s — t 

v — —— or s — u + V, t — u — v 


ds dt = 2 dudv 


The double integral then factors into 


J(p+p')u 


dii 


Jip-p')v dv = 2 ^- {^y~) eHp+p )/2 ^ 2jld (P - p')] 


J_ oc (l + 2i u)J+ l 

where the first integral is evaluated by equation (A.ll) and the second by (C.6). 
Equation (F.29) becomes 

P + P' v 


^Xkj(P)Xkj(p') 

k=j 


d(p- p') 


[2ipp'y/ 2 \ 

By applying equation (C.5e), we obtain the completeness relation 

OO 

^Xkj(P)Xkj(p') = dip - p') 
k=j 

demonstrating according to equation (3.31) that the sct^^(p) is complete. 


(F.30) 
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Series solutions of differential equations 


General procedure 


The application of the time-independent Schrodinger equation to a system of chemical 
interest requires the solution of a linear second-order homogeneous differential equa¬ 
tion of the general form 


d 2 u(x) du(x) 

p(x) , 9 + q(x) —— + r(x)u(x) = 0 


(G.l) 


dx 2 ' dx 

where p{x), q(x), and r(x) are polynomials in x and where p(x) does not vanish in 
some interval which contains the point x = 0. Equation (G.l) is linear because each 
term contains u or a derivative of u to the first power only. The order of the highest 
derivative determines that equation (G.l) is second-order. In a homogeneous differ¬ 
ential equation, every term contains u or one of its derivatives. 

The Frobenius or series solution method for solving equation (G.l) assumes that the 
solution may be expressed as a power series in x 


OO 

ii = a k x k+s — OqX s + ai.x i+1 + • • • 
k=0 


(G.2) 


where a k (k — 0, 1,2....) and s are constants to be determined. The constant s is 
chosen such that ao is not equal to zero. The first and second derivatives of u are then 
given by 

A., oo 

— — ii’ — ^ a k {k + s)x k+s ~ l — a 0 sx s ~ l + ni(5 + l)x* + • • • (G.3) 

^ k^o 

d 2 u 00 

—== u" — ^ a k (k + s)(k + s — \)x k+s ~ 2 — a Q s(s - \)x s ~ 2 + a\(s + l)sx^' + • • • 
^ k^o 


(G.4) 

A second-order differential equation has two solutions of the form of equation (G.2), 
each with a different set of values for the constant 5 and the coefficients a k . 

Not all differential equations of the general form (G.l) possess solutions which can 
be expressed as a power series (equation (G.2)). 1 However, the differential equations 
encountered in quantum mechanics can be treated in this manner. Moreover, the power 


1 For a thorough treatment see F. B. Hildebrand (1949) Advanced Calculus for Engineers (Prentice-Hall, 
New York). 
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series expansion of u is valid for many differential equations in which p(x), q(x), and/ 

or r(x) are functions other than polynomials, 2 but such differential equations do not 

occur in quantum-mechanical applications. 

The Frobenius procedure consists of the following steps. 

1. Equations (G.2), (G.3), and (G.4) are substituted into the differential equation (G.l) 
to obtain a series of the form 

OO 

a k [{k + s)(k + s — 1 )p(x)x k+s ~ 2 + (k + s)q(x)x k+s ~ l + r(x)x k 1 '] — 0 

k =0 

2. The terms are arranged in order of ascending powers of x to obtain 

OO 

]T c k x k+s ~ 2 = 0 (G.5) 

k=a 

where the coefficients c k are combinations of the constant ,v, the coefficients a k , and 
the coefficients in the polynomials p(x), q(x), and r(x). The lower limit a of the 
summation is selected such that the coefficients c k for k < a are identically zero, 
but c a is not. 

3. Since the right-hand side of equation (G.5) is zero, the left-hand side must also 
equal zero for all values of x in an interval that includes x = 0. The only way to 
meet this condition is to set each of the coefficients c k equal to zero, i.e., c k — 0 for 
k = a, a + 1,... 

4. The coefficient c a of the lowest power of x in equation (G.5) always has the form 
c a — «o/G), where f(s) is quadratic in 5 because the differential equation is 
second-order. The expression c a = a 0 f (s) — 0 is called the indicial equation and 
has two roots, s\ and s 2 , assuming that a 0 f 0. 

5. For each of the two values of s, the remaining expressions c k — 0 for k — a + 1, 
a + 2, ... determine successively a\, a 2 , ... in terms of Go- Each value of 5 yields 
a different set of values for ap one set is denoted here as a k , the other as a' k . 

6. The two mathematical solutions of the differential equation are u\ and u 2 

u\ — a Q x s '[\ + (ai/a 0 )x + (a 2 /ao)x 2 + • • •] 

u 2 = a' 0 x S2 \l + (a[/a' 0 )x + (a 2 /a' 0 )x 2 + • • •] 

where a 0 and a' 0 are arbitrary constants. Physical solutions are obtained by applying 
boundary and normalization conditions to u\ and u 2 . 

7. For some differential equations, the two roots si and s 2 of the indicial equation 
differ by an integer. Under this circumstance, there are two possible outcomes: (a) 
steps 1 to 6 lead to two independent solutions, or (b) for the larger root ,V|, steps 1 
to 6 give a solution u\ , but for the root s 2 the recursion relation gives infinite values 
for the coefficients a k beyond some specific value of k and therefore these steps fail 
to provide a second solution. For some other differential equations, the two roots of 

2 See for example E. T. Whittaker and G. N. Watson (1927) A Course of Modern Analysis, 4th edition 
(Cambridge University Press, Cambridge), pp. 194-8; see also the reference in footnote 1 of this 
Appendix. 
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the indicial equation are the same (h i = s 2 ) and therefore only one solution u\ is 
obtained. In those cases where steps 1 to 6 give only one solution u\, a second 
solution ui may be obtained 3 by a slightly more complex procedure. This second 
solution has the form 

OO 

U 2 ~ cu\ In x + c b k x k+Sl 
k= 0 


where c is an arbitrary constant and the coefficients b k are related to the coefficients 
(p. However, a solution containing lnx is not well-behaved and the arbitrary 
constant c is set equal to zero in quantum-mechanical applications. 

8. The interval of convergence for each of the series solutions U\ and u 2 may be 
determined by applying the ratio test. For convergence, the condition 


lim 

> OO 


a k 


IjcI < 1 


must be satisfied. Thus, a series converges for values of x in the range 

where R is defined by 


1 1 

-< x < — 

R R 


R = lim 

k—* oo 


a k +i 

a k 


For R equal to zero, the corresponding series converges for —oo < x < oo. If R 
equals unity, the corresponding series converges for — 1 < x < 1. 


Applications 

In Chapters 4, 5, and 6 the Schrodinger equation is applied to three systems: the 
harmonic oscillator, the orbital angular momentum, and the hydrogen atom, respec¬ 
tively. The ladder operator technique is used in each case to solve the resulting 
differential equation. We present here the solutions of these differential equations 
using the Frobenius method. 


Harmonic oscillator 

The Schrodinger equation for the linear harmonic oscillator leads to the differential 
equation (4.17) 


d 2 0(£) 

df- 






(G.6) 


If we define 2 by the relation 
and introduce this expression into equation (G.6), we obtain 


7 F 

22 + 1 = — 
na> 


(G.7) 


cp" + (22 + 1 - £ 2 )0 = 0 


(G.8) 


3 See footnote 1 of this Appendix. 
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We first investigate the asymptotic behavior of 0(£). For large values of f, the 
constant 21 + 1 may be neglected in comparison with f 2 and equation (G.8) becomes 

<P"= £ 2 0 

The approximate solutions of this differential equation are 

0 = ce^ 2 / 2 

because we have 

(j>" — (£ 2 ± 1)0 « £ 2 0 for large £ 

The function ± 2/2 is not a satisfactory solution because it becomes infinite as 
£ —» ±oo, but the function c 2-/2 is well-behaved. This asymptotic behavior of 0(£) 
suggests that a satisfactory solution of equation (G.8) has the form 

m = »(£)e-« 2 / 2 (G.9) 

where u{f) is a function to be determined. 

Substitution of equation (G.9) into (G.8) gives 

u" — 2£w' + 2Xu = 0 (G.10) 

We solve this differential equation by the series solution method. Applying equations 
(G.2), (G.3), and (G.4), we obtain 

OO OO 

Y a k (k ± s)(k ± s - 1)^+*- 2 + Y a k [-2(k ± s) ± 21]^ +i = 0 (G.l 1) 
k =0 k =0 

The coefficient of f s 2 gives the indicial equation 

a 0 s(s - 1) = 0 (G.l2) 

with two solutions, s — 0 and 5=1. The coefficient of 'f s 1 gives 

m(5± l)s = 0 (G.l3) 

For the case 5 = 0, the coefficient a\ has an arbitrary value; for 5 = 1, we have a\ — 0. 

If we omit the first two terms (they vanish according to equations (G.12) and 
(G. 13)) in the first summation on the left-hand side of (G.l 1) and replace the dummy 
index k by 1 + 2 in that summation, we obtain 

OO 

Y{ a k+i( k + * + 2 ){k + 5 + 1) + a k [-2{k + 5) + 2XM k+s = 0 (G.14) 

k =0 


Setting the coefficient of each power of c equal to zero gives the recursion formula 

2(k ±5 — 1) 


a k +2 


-a k 


(G.l 5) 


(k ± 5 ± 2 )(k ± 5 ± 1) 

For the case 5 = 0, the constants ao and a\ are arbitrary and we have the following 
two sets of expansion constants 
a o a i 

2(1 - 1 ) 


Q2 — —lr/() 


C?3 = 


3! 


a i 


«4 = - 


«6 =' 


2(2 - 1 ) 
4-3 
2(4 -1) 
6-5 


-a 2 — -- 


2 2 1(2 - 1 ) 


4! 


-a o 


- 4 = — - 


2 3 1(2 —1)(4 — 1) 
6 ! 


as 


- Uq Clj 


2(3-1) 2 2 (1 —1)(3 — 1) 


5-4 


- «3 = - 


5! 


-a\ 


2(5-1) 2 3 (1 —1)(3 —1)(5 — 1) 


7-6 


-a 5 


7! 


a i 


Thus, the two solutions of the second-order differential equation (G.10) are 
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U\ — ClQ 


l-Xf 


2 2 X{2 - X) fr4 2 3 2(2 - X)(4 - X) 
4! * 6! * 


(G.16a) 


ii 2 = a i 


2(1-1) 3 2 2 (1 — A)(3 - A) s 2 3 (1 —1)(3 — A)(5 — 1) 7 

s -I- 3 ! s -i- 5 , 5 + 7 , S t 


(G.16b) 

The solution u\ is an even function of the variable £ and u 2 is an odd function of £. 
Accordingly, mi and u 2 are independent solutions. For the case s — 1, we again obtain 
the solution u 2 . 

The ratio of consecutive terms in either series solution u\ or u 2 is given by the 
recursion formula with s — 0 as 


ak+ih 


k+2 


In the limit as k 


2(k - X) 

a k £ k ~{k + 2)(k+\)' 

oo, this ratio approaches zero 

Cl k +2^ k+1 

hm - —r — 

£-*oo a k g K 


2 , 
. hm t £ 

k —>oo /C 


o 


so that the series u\ and u 2 converge for all finite values of To see what happens to 
Mi and u 2 as £ —> ±oc, we consider the Taylor series expansion of e c 

OO tin i4 tb 


J? - 


SV= 1+ « 2 + 21+3! + 


The coefficient a n is given by a n = l/(«/2)! for n even and a n = 0 for n odd, so that 


^■n+ 2^ 


n +2 


- 1 ! 


m„£' ! 


n + 2 


-£ 2 = 


: + 1 


>2 2^ 2 

£ ~- as 77 

77 


OO 


Thus, Mi and m 2 behave like e^~ as £ —> ±oo. For large |£|, the function 0(£) behaves 
like 


0(£) = M(^)e ^ 2,/2 w e^e ^ 2 = e^"/ 2 —> oo as £ —> ±oo 

which is not satisfactory behavior for a wave function. 

In order to obtain well-behaved solutions for the differential equation (G.8), we need 
to terminate the infinite power series mi and u 2 in (G.16) to a finite polynomial. If we 
let X equal an integer n (n = 0, 1, 2, 3,...), then we obtain well-behaved solutions 

m 

n — 0, (po — ^ ^ 2 , «i = 0 

n— 1 , <p\ — Mi^e -4 / 2 , Mo — 0 

n — 2, cp 2 = a 0 (\-2£ 2 )e^ 2/2 , a\ = 0 

n — 3, 03 — mi^(1 - = £ 2 )e^ 2/2 , m 0 — 0 

n = 4, 04 — m 0 ( 1 4§ 2 + |^ 4 )e _ ^ 2/2 , Mi = 0 

n — 5, 05 ~ m^( 1 - f£ 2 + ^£ 4 )e“ |2/2 , m 0 = 0 
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Since the parameter A is equal to a positive integer n, the energy E of the harmonic 
oscillator in equation (G.7) is 

E n — (b + ))hoj, n = 0, 1,2,... 

in agreement with equation (4.30). Setting A in equation (G.10) equal to the integer n 
gives 

u" — 2£w' + 2nu — 0 (G.17) 

A comparison of equation (G.17) with (D.10) shows that the solutions «(£) are the 
Hermite polynomials, whose properties are discussed in Appendix D. Thus, the 
functions (/)„()) for the harmonic oscillator are 

<t>n(§) = a„H /2 

where a n are the constants which normalize (/>„())■ Application of equation (D. 14) 
yields the final result 

= (2 n n\y ll2 Jt-^ 4 H n (^ 2 / 2 

which agrees with equation (4.40). 


Orbital angular momentum 

We wish to solve the differential equation 


Lrxp(9, cp ) = Xh 2 ip(0, tp) 


where L 2 is given by equation (5.32) as 

l 2 = -a 21 1 d 


sin 0 d6 


sin 0 


d 


+ 


1 d 2 


dO) sin 2 0dcp 2 


(G.18) 

(G.19) 


We write the function ip(0, cp) as the product of two functions, one depending only on 
the angle 0, the other only on cp 


f(0, tp) = 0(0)<D(<p) (G.20) 

When equations (G.19) and (G.20) are substituted into (G.18), we obtain after a little 
rearrangement 


sin 6? d 


0 AO 


d0 


AO 


— sin 0 —— + A sin' 0 — — 


1 d z O 
<t> Acp 2 


(G.21) 


The left-hand side of equation (G.21) depends only on the variable 0, while the right- 
hand side depends only on tp. Following the same argument used in the solution of 
equation (2.28), each side of equation (G.21) must be equal to a constant, which we 
write as nf. Thus, equation (G.21) separates into two differential equations 


sin# d 

~eTde 



+ A sin 2 0 = m 2 


(G.22) 


and 


d 2 <E> 

Acp 2 


— m 2( A> 


(G.23) 


The solution of equation (G.23) is 

<f> = Ae imv ’ (G.24) 

where A is an arbitrary constant. In order for <1> to be single-valued, we require that 

<f>( cp) — <£>(cp + 2 7t) 


or 



324 


Appendix G 

^2mm _ i 


so that m is an integer, m — 0, ±1, ±2,... 

To solve the differential equation (G.22), we introduce a change of variable 

p — cos 6 (G.25) 

The function 0(0) then becomes a new function F(u) of the variable p, 0(0) — F(p), 
so that 


d0 dFdp . dF 2 pl2& F 

d0 dp dO dp ' dp 

Substitution of equations (G.25) and (G.26) into (G.22) gives 


_d_ 

dp 


dp\ 


+ I 2 — 


1 — p 2 


F = 0 


or 


(1 - p 2 )F" - 2pF' + ( 2 - -— 




F = 0 


(G.26) 


(G.27) 


A power series solution of equation (G.27) yields a recursion formula relating cik+ 4 , 
cik 12 , and (tk, which is too complicated to be practical. Accordingly, we make the 
further definition 


F(p) = (1 - p 2 )^ 2 G(p) (G.28) 

from which it follows that 


F' = (1 — J a 2 ) |m| / 2 [G' - \m\p(l - p 2 Y x G ] (G.29) 

F" = (1 - p 2 ) lml/2 \G" - 2\m\p(l - p 2 y l G' - \m\(\ - p 2 y x G 

+ \m\{\m\ - 2)p\l - p 2 Y 2 G] (G.30) 

Substitution of (G.28), (G.29), and (G.30) into (G.27) with division by (1 — /r)l"'l/ 2 
gives 

(1 — p 2 )G" — 2(\m\ + \)pG' + [2 — H(H + 1)]G = 0 (G.31) 

To solve this differential equation, we substitute equations (G.2), (G.3), and (G.4) 
for G, G', and G" to obtain 

OO OO 

dk(k + s)(k + s — 1 )p k+s ~ 2 + oa[2 — (k + s + \m\)(k + 5 + \m\ + l)]^ i+s 
k =o k =o 


= 0 (G.3 2) 

Equating the coefficient of p s ~ 2 to zero, we obtain the indicial equation 

a () s(s - 1) = 0 (G.33) 

with solutions s — 0 and .v — 1. Equating the coefficient of p s 1 to zero gives 

a\s(s + 1) = 0 (G.34) 

For the case 5 = 0, the coefficient a\ has an arbitrary value, while for ,v = 1, the 
coefficient a\ must vanish. 

If we replace the dummy index k by k + 2 in the first summation on the left-hand 
side of equation (G.32), that equation becomes 
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'y 'f ak+zi.k + s + 2)(k + s + 1) + ctk [A — (k + s + |/w|)(& + s + \tn\ + 1)]}// 


k+s 


= o 


k =0 


(G.35) 

The recursion formula is obtained by setting the coefficient of each power of u equal 
to zero 


(k + 5 + | m\)(k + s + \m\ + 1) - A 
a k +2 = - 77 , ■ , - a k 


(G.36) 


(k + s + 2 )(k + s + 1) 

Thus, we obtain a result analogous to the harmonic oscillator solution. The two 
independent solutions are infinite series, one in odd powers of u and the other in even 
powers of ft. The case ,v — 0 gives both solutions, while the case 5=1 merely 
reproduces the odd series. These solutions are 

/ \m\(\m\+\)-X 2 [(M +2)(\m\ +3) - A][|/m|(|/«| + 1) - A] 4 

°i = «o1 1 H-77,- 1 1 d-77- , u 


2 ! 


4! 


+ ...} 


(G.37a) 


_ / (\m\ + \)(\m\ + 2)-X 3 

vt2 — a 15 f.i H- ^ 

KM + 3)(|m| + 4) - A][(MI + 1)(M + 2) - A] 5 , 
+ 5! f 


(G.37b) 


The ratio of consecutive terms in G\ and in G 2 is given by equation (G.36) as 


a k +2ft 


k+2 


{k+ \m\)(k+ \m\ + 1) - A 2 


In the limit as k 


atfi k (k+l)(k + 2) 

00 , this ratio becomes 
. k+2 


[X 


lim 

k^> 00 


Cl k+2 ^ 


k 1 — X 


-fi —* [A, 


akfi k k 2 

As long as \fi\ < 1, this ratio is less than unity and the series G\ and G 2 converge. 
However, for u — 1 and ft — — 1, this ratio equals unity and neither of the infinite 
power series converges. For the solutions to equation (G.31) to be well-behaved, we 
must terminate the series Gi and G 2 to polynomials by setting 

A = (k + \m\)(k + \m\ + 1) = 1(1 + 1) (G.38) 

where / is an integer defined as / = \m\ + k, so that l — \m\, \ m\ + 1, \ m\ + 2, ... We 
observe that m f S o that m takes on the values —/,—/+ 1,..., -1,0, 1,..., 

I-1,1. 

Substitution of equation (G.38) into the differential equation (G.27) gives 

( \ 

(\-fi 2 )F"-2fiF’+ 1(1+ 1)-- -, 

V 1 ~/ i y 

which is identical to the associated Legendre differential equation (E.17). Thus, the 
well-behaved solutions to (G.27) are proportional to the associated Legendre poly¬ 
nomials P\ m \(ff) introduced in Appendix E 

F(+) = 

Since we have 0(0) = Ffu), where u — cos 0, the functions 0(0) are 

„(< 

and the eigenfunctions if (6, cp) of L 2 are 


(G.39) 


F(+ = cP l , m] (ju) 


0/m(0) — CP I / '" I (COS0) 
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VUO, cp) = c lm P ] ;\cos 0) e im<p (G.40) 

where c/ m are the normalization constants. A comparison of equation (GAO) with 
(5.59) shows that the functions cp) are the spherical harmonics Yi m (6, cp). 


Radial equation for the hydrogen-like atom 

The radial differential equation for the hydrogen-like atom is given by equation (6.24) 
as 


&S 

dp 2 


2dS 

pdp + 


1 A 1(1+1) 
4 p p 2 


5 = 0 


(G.41) 


where / is a positive integer. If a power series solution is applied directly to equation 
(G.41), the resulting recursion relation involves ak+i, a k\\, and ak- Since such a three- 
term recursion relation is difficult to handle, we first examine the asymptotic behavior 
of S(p). For large values of p, the terms in p 1 and p 2 become negligible and 
equation (G.41) reduces to 

dj5 _ 5 
dp 2 4 


or 

5 = ce ±p/2 

where c is the integration constant. Since p, as defined in equation (6.22), is always 
real and positive for £ 0, the function c /l/2 is not well-behaved, but e G 2 is. 

Therefore, we let 5(p) take the form 

S(P) = F(p)c p! 1 (G.42) 

Substitution of equation (G.42) into (G.41) yields 

p 2 F" + p(2 - p)F' + [(A - l)p - 1(1 + 1)]F = 0 (G.43) 

where we have multiplied through by p 2 c ,,/2 . To solve this differential equation by the 
series solution method, we substitute equations (G.2), (G.3), and (G.4) for F, F', and 
F" to obtain 


OO OO 

Y a k [(k + s)(k + s+D-1(1 + l)]p i+s + Y a*(A ~ 1 - k - s)p k+s+1 = 0 
k =0 k =0 

(G.44) 

The indicial equation is given by the coefficient of p v as 

a 0 [s(s + 1) - /(/ + 1)] = 0 (G.45) 

with solutions s — 1 and 5 = —(l + 1). For the case s — —(1 + 1), we have 

F(p) =: <3 0 p _(/+1) + Clip-' + a 2 p~ l+1 4 - (G.46) 

which diverges at the origin. 4 Thus, the case s — l is the only acceptable solution. 

Omitting the vanishing first term in the first summation on the left-hand side of 
(G.44) and replacing k by k + 1 in that summation, we have 


4 The reason for rejecting the solution s = —(/+ 1) is actually more complicated for states with 1 = 0. 
I. N. Levine (1991) Quantum Chemistry, 4th edition (Prentice-Hall, Englewood Cliffs, NJ), p. 124, 
summarizes the arguments with references to more detailed discussions. The complications here strengthen 
the reasons for preferring the ladder operator technique used in the main text. 
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OO 

5^{a* +1 [(* + \)(k + 21 + 2)] + a k (A - l-l - k)}p k+s+1 = 0 (G.47) 

k=0 

Since the coefficient of each power of p must vanish, we have for the recursion 
formula 


k+l+l-A 

° k+1 ~ (k+ l)(k + 2l + 2) Uk 
Thus, we obtain the following set of expansion constants 
a 0 

1+ 1 -A 
ai ^^iTT ao 

_l+2-A _(l + 2- A)(l +1 - A) 

° 2 ~~ 2(2/ + 3) Ql 2(21 H- 3X2/ + 2) ^ 

_/ + 3 -A _ (/ + 3 - A)(l + 2- A)(l + 1 - A) 

° 3 “ 3(2/+ 4) a2 ~ 3!(2/ + 4)(2/ + 3)(2/ + 2) a ° 


(G.48) 


so that the solution of (G.43) is 


F — a 0 p‘ 


(, , y' (I + k - A)(l + k - 1 - A) ■ ■ ■ (l + 1 - A) A 

^ ff k\(2l + k + 1 )(2 / + k) ■ ■ ■ (21 + 2 ) P J 


(G.49) 


We have already discarded the second solution, equation (G.46). 

The ratio of consecutive terms in the power series expansion F is given by equation 
(G.48) as 

cik+ip k+l+x k + l + 1 — A 
a k p k+l ~ (k + l)(k + 21 + 2) P 

In the limit as k —» oo, this ratio becomes p/k, which approaches zero for finite p. 
Thus, the series converges for all finite values of p. To test the behavior of the power 
series as p —» oo, we consider the Taylor series expansion of d’ 


e r = y f - 

^ Id 


k= o 


and note that the ratio of consecutive terms is also p/k. Since the behavior of Fas 
p —> oo is determined by the expansion terms with large values of k (k oo), we see 
that F behaves like e p as p —> oo. This behavior is not acceptable because Sip) in 
equation (G.42) would take the form 

S(p) —> p l e p e~ p l 2 — p l e p ^ 2 —> oo as p —> oo 

and could not be normalized. 

The only way to avoid this convergence problem is to terminate the infinite series 
(equation (G.49)) after a finite number of terms. If we let A take on the successive 
values l + 1, l + 2,, then we obtain a series of acceptable solutions of the 
differential equation (G.43) 
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/+ 1 , Fq — aop 1 


1 + 2 , f.W(< l-27^2*>) 

' + 3 ' fi = ‘ ,W '( 1 ~'TT P + (2/ + 3)(2/+2) '’ 2 ) 


Since l is an integer with values 0, 1, 2,, the parameter A takes on integer values n, 
n = 1, 2, 3, ..., so that n — l + 1, / + 2, ... When the quantum number n equals 1, 
the value of / is 1; when n — 2, we have / = 0, 1; when n = 3, we have 1=0, 1,2; 
etc. 

The energy E of the hydrogen-like atom is related to A by equation (6.21). If we 
solve this equation for E and set A equal to n, we obtain 

pZ 2 Q ,A 


E„ = - 


2h 2 n 2 


n = 1, 2, 3, ... 


in agreement with equation (6.48). 

To identify the polynomial solutions for F(p), we make the substitution 

F(p) = p'u(p) (G.50) 

in the differential equation (G.43) and set A equal to n to obtain 

pu" + [2(1 + 1) - p]u' + (n — l — 1 )u m 0 (G.51) 


Since n and / are integers, equation (G.51) is identical to the associated Laguerre 
differential equation (F.16) with k — n + / and j = 21+ 1. Thus, the solutions u(p) 
are proportional to the associated Laguerre polynomials L 2 J+j(p), whose properties are 
discussed in Appendix F 

u(p) = cL 2 !+ l (p) (G.52) 

Combining equations (G.42), (G.50), and (G.52), we obtain 

S*(p)**niP l e- p/2 L 2 £l(p) (G.53) 

where c„; are the normalizing constants. Equation (G.53) agrees with equation (6.53). 
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Recurrence relation for hydrogen-atom expectation 

values 


The expectation values (r k ) „/ of various powers of the radial variable r for a hydrogen¬ 
like atom with quantum numbers n and / are given by equation (6.69) 


{r k )n, = 


r K [R n i(r)?r 2 dr 


(H.l) 


where R„i( r) are the solutions of the radial differential equation (6.17). In this 
appendix, we show that these expectation values are related by the recurrence relation 

1 - k 2 




nl 


+ k 


i(i + 1 )+■ 


f 2 (r k - 2 )m = 0 (H.2) 


To simplify the notation, we define the real function u(r) by u = rR n /( r) and denote 
the first and second derivatives of u( r) by u' and it". Equation (H.l) then takes the 
form 


Since we have 



r k u 2 dr 


o 


(H.3) 


d/^(r) u' 
dr r 

equation (6.17) becomes 


u 


rr 


u_ d f 2 d R(r) \ 

r 2 ’ dr \ dr J 


1(1+ 1) 2Z 
r 2 ciq r 


n 2 Oa 


(H.4) 


where equation (6.57) for the energy E„ has also been introduced. 

Before beginning the direct derivation of equation (H.2), we first derive a useful 
relationship. Consider the integral 


and integrate by parts 


•OO 

r v uu' dr 
o 


v t j v 2 

r uu dr = r u 


u — (r v u) dr 
o dr 


The integrated part vanishes because R(r) —>■ 0 exponentially as r —> oo and u( r) 0 
as r —+ 0. Expanding the derivative within the integral on the right-hand side, we have 
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*oo 

r v uu' dr = — v 

*oo 

r v ~ l u 2 dr - 

Jo 

0 


r v uu' dr 


Combining the integral on the left-hand side with the last one on the right-hand side, 
we obtain the desired result 


r v ini' d r — - (r 


V— 1\ 


-° 2 

To obtain the recurrence relation (H.2), we multiply equation (H.4) by r k+l u' and 
integrate over r 

2 coo 


(H.5) 


r k+1 u'u"dr= 1(1+ 1) 


POO rj ry pOO 

r k -1 ■ - /Z 


l uu' dr- 

fl 0 


r k uu' dr + - 


2„2 


n a 


r k+1 uu' dr 


l(l+l)(k-l) kZ . h (k+l )Z 2 , 

-2- ( " - <H - 6) 

where equation (H.5) was applied to the right-hand side. The integral on the left-hand 
side of (H.6) may be integrated by parts twice to give 

d 

u' — (r k+1 u')dr 
Jo dr 


' k+1 u'u"dr- 


— ~(k + 1 ) 

= (.k + 1 ) 


coo 

r k u'u' dr — 
o 

r°° d 


u-(r u') dr - 
o dr 


* k+1 u'u"dr 


- k+1 u'u"dr 


t= k(k+ 1) 


r k l uu' dr + (k + 1) 


r k uu" dr — 


^ +1 w'w"dr 


The integral on the left-hand side and the last integral on the right-hand side may be 
combined to give 


-* +1 w'«"dr = - 


{k -\)k(k+\) k _ 2 (k+ 1) 

— V /«/ + ■ 


r k uu" dr 


(H.7) 


Jo 4 ' 2 

where equation (H.5) has been used for the first integral on the right-hand side. 
Substitution of equation (H.4) for u" in the last integral on the right-hand side of (H.7) 
yields 


rk+i u'n"dr = - W +1 > {r k -2 )n/ + (k+DKl+l) {rk . 2] 


(k+l)Z t _i (k+l)Z 2 M 

\> ) nl + 0 t _2 V ) n l 


nl 


a o 


2 n 2 a^ 


(H.8) 


Combining equations (H.6) and (H.8), we obtain the recurrence relation (H.2). 
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Matrices 


An m X n matrix A is an ordered set of mn elements ay (i — 1,2,, m; j — 1, 2, 

..., n ) arranged in a rectangular array of m rows and n columns, 

0n On • • ■ a i„ 

021 022 ' ' ' 02/1 

O/nl O m 2 ' ' ' O mn 

If /« equals «, the array is a square matrix of order n. If we have m — 1, then the 
matrix has only one row and is known as a row matrix. On the other hand, if we have 
n — 1 , then the matrix consists of one column and is called a column matrix. 




Matrix algebra 

Two m X n matrices A and B are equal if and only if their corresponding elements are 
equal, i.e., ay — by for all values of i and j. Some of the rules of matrix algebra are 
defined by the following relations 

A + B = B + A = C; cy = ay + by 

A - B = C; Cy — ay - by (1.2) 


/v A = C; Cy — kay 

where k is a constant. Clearly, the matrices A, B, and C in equations (1.2) must have 
the same dimensions mX n. 

Multiplication of an m X n matrix A and an n X p matrix B is defined by 

n 

AB =3 C; Cik :my^2 a 9 b jk (1-3) 

7=1 

The matrix C has dimensions m X p. Two matrices may be multiplied only if they are 
conformable, i.e., only if the number of columns of the first equals the number of rows 
of the second. As an example, suppose A and B are 


/ an 

«12 

013 ^ 


(b n 

bu \ 

1 fl21 

022 

023 

|; B= | 

, &21 

bn J 

\«31 

032 

033 / 


\ bn 

bn J 


Then the product AB is 
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/ a \\b{\ + «12^21 + «13^31 a \\b\2 + Clubu + 013^32 \ 

AB = I 021^11+^22^21+^23^31 <?21 ^12 + <^22^22 + «23^32 ) 

\ <^31 ^11 + <332^21 + <333Z?31 <331 Z>12 + <732^22 + <733^32 / 

Continued products, such as ABC, may be defined and evaluated if the matrices are 
conformable. In such cases, multiplication is associative, for example 

= £E UijbjkCki 


ABC 


da 


A(BC) = (AB)C — D; 

For the null matrix 0, all the matrix elements are zero 


(1.4) 


( 0 

0 •• 

' ° \ 

0 

0 •• 

• 0 

V 0 

0 •• 

• 0/ 

0 +A 

= A + 0 

= A 


(1.5) 


and we have 

( 1 . 6 ) 

The product of an arbitrary m X n matrix A with a conformable n X p null matrix is 
the m X p null matrix 

AO = 0 (1.7) 

In matrix algebra it is possible for the product of two conformable matrices, neither of 
which is a null matrix, to be a null matrix. For example, if A and B are 


1 

1 

°\ 


( 1 


2 

-2 

2 ; 

B — ( 

-1 

1 

3 

3 

1 / 

\ 

V 0 

0 / 


then the product AB is the 3X2 null matrix. 

The transpose matrix A T of a matrix A is obtained by interchanging the rows and 
columns of A. If the matrix A is given by equation (1.1), then its transpose is 


/ a n 
a\2 


<721 

<722 


O n> 1 \ 

Cl m2 


( 1 . 8 ) 


J 


\ Cl \n &2n 

Thus, the elements a]- of A T are given by aT. = a /r 
Let the matrix C be the product of matrices A and B as in equation (1.3). The 
elements cj k of the transpose of C are then given by 

c T - = V a kj bji V albl - bla T 


3 = 1 


3= 1 


3 = 1 


ij u jk 


(1.9) 


where we have noted that <7+j = ap a and b J a p — bp a . Thus, we see that 

c T = b t a t 


or 

(AB) t = B t A t (1.10) 

This result may be generalized to give 

(AB...Q) t = Q t ..-B t A t (1.11) 

as long as the matrices are conformable. 

If each element aj in a matrix A is replaced by its complex conjugate a*, then the 
resulting matrix A* is called the conjugate of A. The transposed conjugate of A is 
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called the adjoint 1 of A and is denoted by A . The elements a\. of A are obviously 
given by a\ = a*. 


Square matrices 

Square matrices are of particular interest because they apply to many physical 
situations. 

A square matrix of order n is symmetric if a j — aji, ( i, j— 1,2,..., n), so that 
A = A T , and is antisymmetric if - -fly,-, ( i,j= 1,2,, ri), so that A = A T . 

The diagonal elements of an antisymmetric matrix must all be zero. Any arbitrary 
square matrix A may be written as the sum of a symmetric matrix A ( s) and an 
antisymmetric matrix A (al 

A = A (S) + A (a) (1.12) 

where 

a f — l(°ij + a jd’ a f ~ \( a v - aji) (1.13) 

A square matrix A is diagonal if fly m 0 for i / j. Thus, a diagonal matrix has the 
form 



/ ai 

0 

• 0 \ 

A = 

0 

fl 2 • ■ 

• 0 


V o 

0 

• a„ J 


A diagonal matrix is scalar if all the diagonal elements are equal, fli » a 2 p 
a„ = a, so that 


/a 0 

••• 0 \ 

0 a 

... 0 

\ 0 0 

a J 

is the unit matrix 1 , for v 

( i o 

••• 0 \ 

0 1 

... 0 

V 0 0 

... ! ) 


(1.14) 


(1.15) 


(1.16) 


The elements of the unit matrix are dy, the Kronecker delta function. 

For square matrices in general, the product AB is not equal to the product BA. For 
example, if 


A = 




0 

3 


then we have 


AB =(" »)■ BA =(“ o)^ AB 

If the product AB equals the product BA, then A and B commute. Any square matrix 
A commutes with the unit matrix of the same order 


1 Mathematics texts use the term transpose conjugate for this matrix and apply the term adjoint to the 
adjugate matrix defined in equation (1.28). 
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Al = IA 

Moreover, two diagonal matrices of the same order commute 

/ a\ 0 ••• 0 \ / b\ 0 ••• 0\ / a\b\ 0 

AD 0 a 2 ■■■ 0||0 b 2 ■■■ 0 I 0 ci 2 b 2 


0 0 


0 0 ■ ■ ■ b„ 


0 0 


( 1 . 18 ) 


Determinants 

For a square matrix A, there exists a number called the determinant of the matrix. This 
determinant is denoted by 


on 

a 12 

^1 n 


\A\ = 

022 

&2n 

(1.19) 

^ n\ 

o n'2 

ftnn 


and is defined as the summation 




Ml = ^dpOl,0 2 y • 

&nq 

(1.20) 


p 


where d P — ± 1 . The summation is taken over all possible permutations i, j, , q of 
the sequence 1, 2, ..., n. The value of 6 P is +1 (— 1) if the order i,j,..., q is 
obtained by an even (odd) number of pair interchanges from the order 1 , 2 ,..., n. 
There are n\ terms in the summation, half with () P — 1 and half with d P p — 1. Thus, 
for a second-order determinant, we have 

\A\ = 11 12 — a\\a 22 — a\ 2 a 2 \ (1.21) 

a 2 \ a 22 

and for a third-order determinant, we have 
nil ni2 

\A\ — a 2 \ a 22 a 22t 

031 a 22 «33 

= 011O22O33 + a\ 2 a 22 a 2 \ + a\ 2 a 2 \a 22 — (nnn23n32 + ni2fl2in33 + a\ 2 a 22 a 2 i) (1.22) 

If the determinant | of the matrix A vanishes, then the matrix A is said to be 
singular. Otherwise, the matrix A is non-singular. 

The determinant A has the following properties, which are easily derived from the 
definition (1.20). 

1. The interchange of any two rows or any two columns changes the sign of the 
determinant. 

2. Multiplication of all the elements in any row or in any column by a constant k gives 
a new determinant of value k\A\. (Note that if B = kA, then \B\ — k n \A\.) 

3. The value of the determinant is zero if any two rows or any two columns are 
identical, or if each element in any row or in any column is zero. As a special case 
of properties 2 and 3, a determinant vanishes if any two rows or any two columns 
are proportional. 
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4. The value of a determinant is unchanged if the rows are written as columns. Thus, 
the determinants of a matrix A and its transpose matrix A T are equal. 

5. The value of a determinant is unchanged if, to each element of one row (column) is 
added a constant k times the corresponding element of another row (column). Thus, 
we have, for example 


an 

<212 

<213 


<2 it + kan 

<?12 

<213 


a ii -f kciii 

<2 12 + kan 

a 13 T - ka^i 

U21 

<222 

<223 

= 

<221 + ka22 

<222 

<223 

= 

«21 

<222 

<223 

U31 

<232 

<233 


U 3 1 + fe?32 

<?32 

<233 


«31 

<232 

<233 


Each element ay of the determinant \A\ in equation (1.19) has a cofactor Cy, which 
is an (n — l)-order determinant. This cofactor Cy is constructed by deleting the /th 
row and the /th column of | A and then multiplying by (— 1 ) l+ f For example, the 
cofactor of the element an in equation (1.22) is 

n «21 <223 

<-12 — — — <223<<31 — <221«33 

«31 (733 

The summation on the right-hand side of equation (1.20) may be expressed in terms 
of the cofactors of the first row of \A\, so that (1.20) becomes 

n 

Ml = ct\\C\\ + a 12 C \2 + • • • + a\ n C\ n = flu-Cu (1-23) 

£=1 

Alternatively, the expression of Ml in equation (1.20) may be expanded in terms of any 
row i 

n 

Ml — y . a ikCik, i = 1, 2, ..., n (1.24) 

k= 1 

or in terms of any column j 

n 

Ml akjCkj, j = 1, 2, ..., n (1.25) 

k= 1 

Equations (1.20), (1.24), and (1.25) are identical; they are just expressed in different 
notations. 

Now suppose that row 1 and row i of the determinant Ml are identical. Equation 
(1.23) then becomes 

n 

Ml = tf/i Cn + ciqC n + • • • + a in C\ n — ajkCik — 0 

k= t 

where the determinant Ml vanishes according to property 3. This argument applies to 
any identical pair of rows or any identical pair of columns, so that equations (1.24) and 
(1.25) may be generalized 

n n 

^ ^ ktikCjk ^ ' QkiCkj — M 1 9 U 1 h j — 1? 2, . . . , O (1.26) 

k= 1 k= 1 

It can be shown 2 that the determinant of the product of two square matrices of the 
same order is equal to the product of the two determinants, i.e., if C = AB, then 

|C| = Ml • Ml (1-27) 

2 See G. D. Arfken and H. J. Weber (1995) Mathematical Methods for Physicists, 4th edition (Academic 
Press, San Diego), p. 169. 
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It follows from equation (1.27) that the product of two non-singular matrices is also 
non-singular. 

Special square matrices 

The adjugate matrix A of the square matrix A is defined as 


c 11 

C21 

Cm > 

c 12 

C22 

Cnl (1.28) 

Ci„ 

C 211 ■ 

Cyin ) 


where Cy are the cofactors of the elements ay of the determinant \ A\ of A. Note that 
the element ay of A is the cofactor C/*-. The matrix product AA is a matrix B whose 
elements by are given by 

n n 

by — ^ ' 0 ( 1 ; Cl jy — ^ ' ClikCjk — 41 r) y (1.29) 

k= 1 k= 1 

where equation (1.26) was introduced. Thus, we have 

AA = B = \A\l — AA (1.30) 

where I is the unit matrix in equation (1.16), and we see that the matrices A and A 
commute. 

Any non-singular square matrix A possesses an inverse matrix A 1 defined as 

kr l =k/\A\ (1.31) 

From equation (1.30) we observe that 

AA 1 = A A 4 I (1.32) 

Consider three square matrices A, B, C such that AB = C. Then we have 

A _1 AB = A _1 C 

or 

B = A 'C (1.33) 

Thus, the inverse matrix plays the role of division in matrix algebra. Multiplication of 
equation (1.33) from the left by B 1 and from the right by C 1 yields 

C 1 = B ‘A 1 

or 

(AB) 1 ■= B 'A 1 (1.34) 

This result may easily be generalized to show that 

(AB Q) 1 = Q 1 B 1 A 1 (1.35) 

A square matrix A is hermitian or self-adjoint if it is equal to its adjoint, i.e., if 
A = A or ay — a*. Thus, the diagonal elements of a hermitian matrix are real. 

A square matrix A is orthogonal if it satisfies the relation 

AA T = A T A = I 

If we multiply AA T = I from the left by A -1 , then we have the equivalent definition 

A T = A 1 

Since the determinants \A\ and \A T \ are equal, we have from equation (1.27) 

\A\ 2 = 1 or \A\ — ±1 
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The product of two orthogonal matrices is an orthogonal matrix as shown by the 
following sequence 

(AB) t = B t A t = B 'A 1 = (AB) 1 

where equations (1.10) and (1.34) were used. The inverse of an orthogonal matrix is 
also an orthogonal matrix as shown by taking the transpose of A -1 and noting that the 
order of transposition and inversion may be reversed 

(A -1 ) 1 = (A T r' = (A - 1 )- 1 

A square matrix A is unitary if its inverse is equal to its adjoint, i.e., if A -1 = A or 
if AA - A A - I. For a real matrix, with all elements real so that ay — a*, there is 
no distinction between an orthogonal and a unitary matrix. In that case, we have 
A f = A T = A '. 


Linear vector space 

A vector x in three-dimensional cartesian space may be represented as a column 
matrix 


x = 



(1.36) 


where x \, xy, *3 are the components of x. The adjoint of the column matrix X is a row 
matrix 


X 1 = (x* x* x*) (1.37) 

The scalar product of the vectors x and y when expressed in matrix notation is 

x y =». (** X* X *) ^ y 2 ^ = x*y\ + X*yi + **j 2 (1-38) 

Consequently, the magnitude of the vector x is 

(x+x) 1 / 2 = (|xi| 2 + |x 2 | 2 + N 2 ) 1/2 (1.39) 

If the vectors x and y are orthogonal, then we have x y = 0. The unit vectors i, j, k in 
matrix notation are 





(1.40) 


A linear operator A in three-dimensional cartesian space may be represented as a 
3X3 matrix A with elements ay. The expression y — Ax in matrix notation becomes 


y = 



( a ii 
021 
031 


«12 

013 \ 

/ Xi 

022 

023 

*2 

032 

O33 ) 

\*3 


( 011*1 + Ol2*2 + 013*3 
021*1 + 022*2 + 023*3 
H" ^32*^2 + ^33-^3 


(1.41) 

If A is non-singular, then in matrix notation the vector X is related to the vector y by 

x = A ‘y (1.42) 

The vector concept may be extended to n-dimensional cartesian space, where we 
have n mutually orthogonal axes. Each vector x then has n components (x \, x 2 , • - •, 
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x n ) and may be represented as a column matrix X with n rows. The scalar product of 
the ^-dimensional vectors x and y in matrix notation is 

x y = x* y, + x*y 2 4-b x*y„ (1.43) 

and the magnitude of x is 

(x f x ) 1/2 = (>! | 2 + | x 2 | 2 + ••• + | x „| 2 ) 1/2 (1.44) 

If we have x^y = 0, then the vectors x and y are orthogonal. The unit vectors \ a 
(a — 1,2,, n) when expressed in matrix notation are 





(1.45) 


If a vector y is related to a vector X by the relation y = Ax and if the magnitude of 
y is to remain the same as the magnitude of X, then we have 

xx = y t y = (Ax) t Ax = x t A t Ax (1.46) 

where equation (1.10) was used. It follows from equation (1.46) that A A = I so that A 
must be unitary. 


Eigenvalues 

The eigenvalues A of a square matrix A with elements ay are defined by the equation 

Ax = lx — Alx (1.47) 

where the eigenvector X is the column matrix corresponding to an n-dimensional 
vector and A is a scalar quantity. Equation (1.47) may also be written as 

(A-Al)x = 0 (1.48) 

If the matrix (A - Al) were to possess an inverse, we could multiply both sides of 
equation (1.48) by (A — AI) 1 and obtain X = 0. Since X is not a null matrix, the matrix 
(A — Al) is singular and its determinant vanishes 

an — A a l2 ' ' ' a \n 

®2\ U22 ~ X ••• «2n 

atii a n 2 ''' a jui A 

The expansion of this determinant is a polynomial of degree n in A, giving the 
characteristic or secular equation 

X" T c n —iX n ' -j- • • • T ciA -j- Co — 0 (1.50) 

where c, (z = 0, 1,..., n — 1) are constants. Equation (1.50) has n roots or eigenvalues 
A a (a = 1,2,..., n). It is possible that some of these eigenvalues are degenerate. 

The eigenvalues of a hermitian matrix are real. To prove this statement, we take the 
adjoint of each side of equation (1.47), apply equation (1.10), and note that A — A 

(Ax) f = xA = x f A = A*x f (1.51) 

Multiplying equation (1.47) from the left by X and equation (1.51) from the right by X, 
we have 

x f Ax = Ax f x 
xAx = A*xx 

Since the magnitude of the vector X is not zero, we see that X — X* and A is real. 
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The eigenvectors of a hermitian matrix with different eigenvalues are orthogonal. To 


prove this statement, we consider two distinct eigenvalues X\ and A 2 and their 
corresponding eigenvectors x (1) and X l2) , so that 

Ax (1) = 2iX (1) (1.52a) 

Ax ( 2) «A 2 x (2) (1.52b) 

If we multiply equation (1.52a) from the left by X (2) i and the adjoint of (1.52b) from the 
right by x ( 1 we obtain 

x ,2,t Ax (1) = A 1 x (2)t x (1) (1.53a) 

(Ax (2) ) t x (1) = x (2)l A^ 11 = x (2)l Ax (1) = A 2 x (2)t x (1) (1.53b) 


where we have used equation (1.10) and noted that A 2 is real. Subtracting equation 
(1.53b) from (I.53a), we find 

(Aj - A 2 )x (2)t x (1) «= 0 

Since X\ is not equal to X 2 , we see that x (1) and X <2) are orthogonal. 

The eigenvector x (a) corresponding to the eigenvalue X a may be determined by 
substituting the value for X a into equation (1.47) and then solving the resulting 
simultaneous equations for the components x ( f\ x ( 3 a> , ..., x ( “> of X 1 " 1 in terms of the 
first component x 3 \ The value of the first component is arbitrary, but it may be 
specified by requiring that the vector x (a> be normalized , i.e., 

x (a)t x (a) = |x ( ! a) | 2 + |xf | 2 + • • • + |x ( „ a) | 2 = 1 (1.54) 

The determination of the eigenvectors for degenerate eigenvalues is somewhat more 
complicated and is not discussed here. 

We may construct an n X n matrix X using the n orthogonal eigenvectors x (a> as 
columns 



(1.55) 


(1.56) 


Equation (1.47) may then be written in the form 

AX = XA (1.57) 

The matrix X is easily seen to be unitary. Since the n eigenvectors are linearly 
independent, the matrix X is non-singular and its inverse X 1 exists. If we multiply 
equation (1.57) from the left by X we obtain 

X 1 AX = A (1.58) 

This transformation of the matrix A to a diagonal matrix is an example of a similarity 
transform. 
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Trace 

The trace Tr A of a square matrix A is defined as the sum of the diagonal elements 

Tl'A — flu + £?22 + • • • + Aral (1-59) 

The operator Tr is a linear operator because 

Tr(A + B) = (an + b\\) + (#22 + bii) + • • • + (a nn + b nn ) = Tr A + Tr B (1.60) 
and 

Tr(cA) = ca\\ + cfl 22 + • • • + ca„„ — cTr A (1.61) 

The trace of a product of two matrices, which may or may not commute, is 
independent of the order of multiplication 

n n n n 

Tr(AB) ]T;r aybji = £ £ b n a i} = Tr(BA) (1.62) 

i=l 7 = 1 y=l i=l 

Thus, the trace of the commutator [A, B] = AB - BA is equal to zero. Furthermore, 
the trace of a continued product of matrices is invariant under a cyclic permutation of 
the matrices 

Tr(ABC • • • Q) = Tr(BC QA) = Tr(C • • • QAB) *• • • (1.63) 

For a hermitian matrix, the trace is the sum of its eigenvalues 

n 

TrA = la (1.64) 

a= 1 

To demonstrate the validity of equation (1.64), we first take the trace of (1.58) to obtain 

n 

Tr(X “ 1 AX) = Tr A = ^ X a (1.65) 

a= 1 

We then note that 

Tr(X *AX) = ^(X^AXW = £ '£'L X a! a u X .^ 

a= 1 a= 1 i=l 7=1 

n n n n n n 

— fly Xj a X ai — aydy — an = TrA (1.66) 

i=l j= 1 a=l i=l 2=1 ;=1 

where (= x'' 1 ) are the elements of X. Combining equations (1.65) and (1.66), we 
obtain equation (1.64). 
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Evaluation of the two-electron interaction integral 


In the application of quantum mechanics to the helium atom, the following integral / 
arises and needs to be evaluated 


I = 


g-(Pl+P2) 

- dpidp2 

P 12 


f e -(pi+P2) 

= ••• - p\p\ sin 6 X sin 0 2 dpi d0i d( Pl dp 2 dd 2 dcp 2 (J.l) 

P 12 

where the position vectors p, (/ = 1 , 2 ) have components p,, 0,, (pj in spherical polar 
coordinates and where 


Pl2 — |P2 — Pi | 

The distance pi 2 is related to p \ and p 2 by the law of cosines 

Pn « p\ + P 2 2 ~ 2p i p 2 cos y (3.2) 

where y is the angle between pi and P 2 as shown in Figure J.l. The integration is 
taken over all space for each position vector. 

The integral / may be evaluated more easily if we orient the coordinate axes so that 
the vector pi lies along the positive z-axis as shown in Figure J.2. In that case, the 
angle y between pi and p 2 is equal to the angle 0 2 . If we define p> as the larger and 
p< as the smaller of p\ and p 2 and define 5 by the ratio 

s = — 

~ P> 

so that s =£ 1 , then equation (J. 2 ) may be expressed in the form 

— = —(1 + s 2 - 2s cos 9 2 y 1 ^ 2 (J.3) 

Pl2 P> 

At this point, we may proceed in one of two ways, which are mathematically 
equivalent. In the first procedure, we note that from the generating function (E.l) for 
Legendre polynomials P h equation (J.3) may be written as 

1 j 00 

— = — Pi (cos 0 2 )s l 
Pi 2 P>j~ J o 

The integral / in equation (J. 1) then becomes 
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z 



Figure J.l Distance between two particles 1 and 2 and their respective distances from 
the origin. 


z 



Figure J.2 Rotation of the coordinate axes in Figure J.l so that the z-axis lies along p\. 


'=E 


1=0 


Ap\+pA 


P> 


-s 1 p\p\ dp \ dp 2 


Pi (cos 62 ) sin 0 2 d0 2 


sin 9\ d$i 


•2n 


drpi 


•In 


dcp 2 


The integrals over d\,q> 1 , and <pi are readily evaluated. Since Pit(u) — I, we may write 
the integral over 0 2 as 
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/’/(cos 62 ) sin 0 2 d 0 2 


Pi(ju)P 0 (ju) d/< = 2d /i 


where equations (E.18) and (E.19) have been introduced. Thus, only the term with 
/ = 0 in the summation does not vanish and we have 


/ = 16 ji 2 


-ipi+pi) 

P> 


p\p\ dpi dp2 


(J-4) 


In the second procedure, we substitute equation (J.3) directly into (J.l) and evaluate 
the integral over 0 2 

I Jt 

„2 1/2 


n _L 2 S1 f 2 —TTm d ° 2 = “C 1 + 52 “ 2 s cos 6 2 y 
q {\ + s 1 — 2 s cos 0 2 yI 1 s 


0 


1 


= - [(1 + + 2 s) 1/2 - (1 + s z - 2 sY /z ] 

s 

= '[(1 -t ,). (1 -.V)j^2 

The integrals over Q \, <p \, and cp 2 are the same as before and equation (J.4) is obtained. 

Since p> is the larger of p\ and p 2 , the integral / in equation (J.4) may be written in 
the form 


d/2 n 


dpi 



•OO 

’ 1 

‘pl 

*00 

/ = 162T 2 

e- pi p 2 

z Pl P 2 dp 2 + 

e~ P2 p2 dp 2 

• 

0 

Pl. 

0 

Pl 


= 16 ji 2 


-pi, 


pi{[2-(p 1 + 2p 1 + 2)e Pl ]+ pi(pi + l)e Pl }dpi = 16jr(f + §) 


Accordingly, the final result is 


1 = 


-(P1+P2) 


P 12 


dp! dp 2 = 20ji 2 


(J.5) 
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Bom-Oppenheimer approximation 264-6, 
268-9, 273 

diatomic molecule 269-80 
nuclear wave function 265-8, 273-4, 276 
moment of inertia 149,278 
momentum operator 43-4, 58, 70, 86 
momentum space wave function, see wave function 
- in momentum space 
Morse potential 279-80 

normalized functions 38-41, 47, 51, 58, 69, 86, 98 
nuclear motion, see molecular structure 

operator 

adjoint 82-3, 163 
exchange 210-11,213-16,230 
for generalized angular momentum 132-4, 155 
hermitian 69-75, 77-87, 104 
linear 65-8, 85, 104 
for momentum 43-4, 58, 70, 86 
for orbital angular momentum 131 -2, 139, 
149-50, 155, 160-1,202-6 
parity 94-6, 253 
permutation 212-16,219-21 
projection 83-4 

self-adjoint, see operator - hermitian 
for spin angular momentum 196-7, 199, 202-6 
see also Hamiltonian operator, ladder operators 
Oppenheimer, J. R. 265 

orbital angular momentum 130-2, 138-46, 155 
classical 130-1 
eigenvalues 138, 140 
ladder operators 140-1 
and magnetic moment 151-5, 190, 201 
operators 131-2, 139, 149-50, 155, 160-1, 
202-6 

operators in spherical polar coordinates 138-40 
spin-orbit coupling 201-6,262 
wave functions 138-47,323-6 
orbitals, atomic, see atomic orbitals 
orthogonal functions 51,69,71-4,80,211,215, 
217 

expansions in terms of 75-7, 84, 88-91, 94, 198 
orthonormal functions, defined 51 -2, 69 
oscillator, see harmonic oscillator 
overlap integral 237 

parity 95, 192 

operator 94-6, 253 
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Parseval’s theorem 10, 18, 35, 41, 288-9, 291 
particle in a box 48-52, 64, 91, 104-5, 230 
energy levels 50, 62-3 
perturbed 261-2 
three-dimensional 61-3,226 
and variation method 234-5 
wave functions 50-2, 62 
Paschen series 156,188-9 
Pauli, W. 195-6,221 
Pauli exclusion principle 221-2, 225, 227 
Pauli spin matrices 200-1,207 
permutation operators 212-16, 219-21 
perturbation theory 239-58,261-2 
degenerate 248-56 

first-order 240-3, 245, 250-4, 257-8, 261 
applied to harmonic oscillator 246-8 
applied to helium atom 257-8 
applied to hydrogen atom 262 
applied to hydrogen atom in electric field 254-6 
applied to a molecule 265-6,276-9 
non-degenerate 239-45 
related to variation method 245 
second-order 240,243-5,261 
applied to spin-orbit coupling 262 
Pfund series 188-9 
phase velocity, see wave packet 
photon 1, 18-19,24-6,30-2, 187 
spin of 217 
Planck, M. 18 
Planck relation 18, 157, 187 
plane wave 2-9, 11, 22, 40 
postulates of quantum mechanics 85-94, 196,217 
principle quantum number, defined 175 
probability density, see wave function - and 
probabilities 

radial distribution function 181, 184-6, 192 
radiation, absorption and emission of 187 
raising operators, see ladder operators 
Rayleigh-Schrodinger perturbation theory, see 
perturbation theory 

reduced mass 149, 158, 175, 188,270-1 
reflection coefficient, see tunneling 
rigid rotor 148-51,274-6,278 
rotational constant 150,275 
Rydberg constant 156-7, 188, 190, 193 
Rydberg potential 279-80 

Schmidt orthogonalization 72-3, 104 
Schrodinger, E. 1,20,37 
Schrodinger equation 

for harmonic oscillator 109, 126 
for hydrogen-like atom 159-61 
for a molecule 264-5 
for particle in a box 48, 61 
time-dependent 37, 59, 85, 92-4 
time-independent 47, 59, 93, 96-7 
Schwarz’s inequality 46, 284 
secular determinant 78, 239, 251-2, 255 
selection rules 192 
series solution method 318-20 


for harmonic oscillator 110, 147-8, 162, 320-3 
for orbital angular momentum 323-6 
for radial equation 326-8 
Slater determinant 221-2 
Sommerfeld, A. 226 

spherical harmonics 139-47, 161, 175, 177, 192, 
274 

spin angular momentum 85,194-207 

of bosons 197-8, 217-18, 221-3, 229-30 
discovery of 194-6 
eigenfunctions 197-9 
eigenvalues 197, 199 

of electron 29, 32-4, 85, 190, 194-6,201-6, 
223-4 

of fermions 197-8, 217-18, 221-9, 231 
gyromagnetic ratio 196 
ladder operators 197, 199 
operators 196-7,199,202-6 
singlet and triplet states 224 
spin one-half 198-201 
see also Pauli spin matrices 
spinor 199 

spin-orbit coupling 201-6,262 
spring constant 107 
square pulse distribution 12-13, 15,35 
Stark effect in atomic hydrogen 254-6, 260-1 
state function 30, 38, 40, 85-6 
see also wave function 
stationary state 47-8, 52, 59, 93-4 
Stern, O.' 19,26 

Stern-Gerlach experiment 26-9,32-4, 195 
symmetric wave function, see wave function 

Thomas precession 202 
Thomson, G. P. 19 

transmission coefficient, see tunneling 
tunneling 53-7, 64 

Uhlenbeck, G. E. 194-6 

uncertainty principle, see Heisenberg uncertainty 
principle 

uncertainty relation, see wave packet - uncertainty 
relation 
Urey, H. 190 

variation method 232-9,260-1 
excited state energies 236-7 
ground state eigenfunctions 234 
ground state energy 232-4 
applied to harmonic oscillator 235-6 
applied to helium atom 259-60 
applied to hydrogen atom in electric field 260-1 
linear variation function 237-9, 261 
applied to particle in a box 234-5 
related to perturbation theory 245 
variation theorem 232-3, 236 
vibration 

of molecular bonds 106 
see also harmonic oscillator 
virial theorem 187,192 
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wave 

frequency of, defined 2-3 
nodes of 2 

see also plane wave, wave motion, wave packet 
wave equation, see Schrodinger equation 
wave function 

collapse of 30-4, 85, 92 
for free particle 20-3, 29-34 
for harmonic oscillator 114-21, 127,320-3 
for hydrogen-like atoms 161, 175 
interpretation of 29-34, 86 
many-particle 60-1,209-25 
in momentum space 40-1,58, 128-9 
normalization 38-41,47, 51, 58, 69, 86, 98 
nuclear 265-8, 273-4, 276 
for orbital angular momentum 138-47, 323-6 
for particle in a box 50-2, 62 
and probabilities 30, 38-41, 47, 58, 60, 86, 91 — 
2, 118-21, 198, 209,222-3 
properties of 29-34, 48-9 
for rigid rotor 150 

symmetry of 209-11,214-21,223-4, 230 
time-dependent 37 
time-independent 47 
well-behaved, defined 68, 162 
see also eigenfunctions, state function, stationary 
state 


wave motion 2-18 
composite wave 4-9 
standing wave 7 

see also plane wave, wave packet 
wave number 

of plane wave, defined 3 
of spectral line 187-8, 190 
wave packet 8—18, 20-2, 29, 34, 36-7, 85 
dispersion of 4-5, 15—18, 20-1, 29, 34 
group velocity of 7, 20-1, 23, 34 
of minimum uncertainty 102-3 
phase velocity of 4-5, 20-1, 34 
uncertainty relation 12, 14-15,18 
see also gaussian distribution, square pulse 
distribution 
wavelength 

de Broglie 19,49-50,58 
of a wave, defined 2 
see also wave motion 

weighting function 69, 72, 74, 77, 162, 164-5 
Young, T. 24 

Young's double-slit experiment 23-6, 29-32 

Zeeman effect 190-2 
Zinn, W. H. 19 



Physical constants 


Speed of light 

c 

2.997 924 58 X 10 8 ms^ 1 
(exact value by definition) 

Elementary charge 

e 

1.602 177 X 10~ 19 C 

Planck’s constant 

h 

6.62608 X 10- 34 Js 


h 

1.054 57 X 10~ 34 Js 

Boltzmann’s constant 


1.380 66 X 10- 23 JKA 1 

Avogadro’s number 

N a 

6.022 14 X 10 23 moH 1 

Mass of electron 

m e 

9.10939 X 10- 31 kg 

Mass of proton 

m p 

1.672 62 X 10- 27 kg 

Mass of neutron 

m n 

1.674 93 X 10- 27 kg 

Permittivity of vacuum 

£ o 

8.854 19 X 10- 12 J- 1 C 2 m-‘ 


4jt£ 0 

1.11265 X 10-‘° J- 1 C 2 m- 1 

Electron spin gyromagnetic ratio g s 

2.002 319 304 

Bohr magneton 

/.Ib = eh/2m e 

9.274 02 X 10- 24 JT-‘ 

^ _ 7 oqt is v in 

Fine structure constant 

Hydrogen atom 

a — e 2 /4ji£ 0 hc 

137.035 99 

Bohr radius 

ao = 4ji£ph 2 / m e e 2 

5.291 77 X 10-“ m 

Rydberg constant 

Rh i 

109 677.32 cm- 1 


Roo = m e e 4 /8fo 2 h 3 c 

109 737.31 cm- 1 

Ground state energy 

Energy conversion 

-13.598 eV 
-2.17864 X 10- 18 J 


1 eV 

= 1.602 177 X 10- 19 J 

— 8065.54 cm 1 

1 J 

fee 6.241 506 X 10 18 eV 

ss 5.034 11 X 10 22 cm- 

1 cm -1 

= 1.239 842 X 10- 4 eV 

= 1.986447 X 10- 23 J 



